Dataset statistics
| Number of variables | 65 |
|---|---|
| Number of observations | 4272 |
| Missing cells | 174608 |
| Missing cells (%) | 62.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.1 MiB |
| Average record size in memory | 520.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 58 |
repeat_instrument_1 has constant value "" | Constant |
repeat_instrument_2 has constant value "" | Constant |
repeat_instance_1 has constant value "" | Constant |
repeat_instance_2 has constant value "" | Constant |
local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_2 has constant value "" | Constant |
data_da_primeira_consulta_institucional_dt_pci_1 has a high cardinality: 2463 distinct values | High cardinality |
data_da_primeira_consulta_institucional_dt_pci_2 has a high cardinality: 349 distinct values | High cardinality |
data_do_diagnostico_1 has a high cardinality: 2460 distinct values | High cardinality |
data_do_diagnostico_2 has a high cardinality: 358 distinct values | High cardinality |
codigo_da_topografia_cid_o_2 has a high cardinality: 72 distinct values | High cardinality |
data_do_tratamento_1 has a high cardinality: 2405 distinct values | High cardinality |
data_do_tratamento_2 has a high cardinality: 322 distinct values | High cardinality |
data_de_recidiva_1 has a high cardinality: 1021 distinct values | High cardinality |
descricao_da_morfologia_de_acordo_com_cid_o_2 has a high cardinality: 73 distinct values | High cardinality |
descricao_da_topografia_2 has a high cardinality: 72 distinct values | High cardinality |
classificacao_tnm_clinico_n_2 is highly imbalanced (54.4%) | Imbalance |
classificacao_tnm_clinico_m_1 is highly imbalanced (70.8%) | Imbalance |
classificacao_tnm_clinico_m_2 is highly imbalanced (65.0%) | Imbalance |
descricao_da_morfologia_de_acordo_com_cid_o_1 is highly imbalanced (82.6%) | Imbalance |
com_recidiva_a_distancia_2 is highly imbalanced (50.5%) | Imbalance |
com_recidiva_regional_1 is highly imbalanced (66.0%) | Imbalance |
com_recidiva_regional_2 is highly imbalanced (80.7%) | Imbalance |
com_recidiva_local_1 is highly imbalanced (60.7%) | Imbalance |
com_recidiva_local_2 is highly imbalanced (64.3%) | Imbalance |
repeat_instrument_2 has 3903 (91.4%) missing values | Missing |
repeat_instance_2 has 3903 (91.4%) missing values | Missing |
data_da_primeira_consulta_institucional_dt_pci_2 has 3903 (91.4%) missing values | Missing |
data_do_diagnostico_2 has 3903 (91.4%) missing values | Missing |
codigo_da_topografia_cid_o_2 has 3903 (91.4%) missing values | Missing |
codigo_da_morfologia_de_acordo_com_o_cid_o_2 has 3903 (91.4%) missing values | Missing |
estadio_clinico_2 has 3903 (91.4%) missing values | Missing |
grupo_de_estadio_clinico_1 has 195 (4.6%) missing values | Missing |
grupo_de_estadio_clinico_2 has 3959 (92.7%) missing values | Missing |
classificacao_tnm_clinico_t_2 has 3903 (91.4%) missing values | Missing |
classificacao_tnm_clinico_n_2 has 3903 (91.4%) missing values | Missing |
classificacao_tnm_clinico_m_2 has 3903 (91.4%) missing values | Missing |
metastase_ao_diagnostico_cid_o_1_1 has 3600 (84.3%) missing values | Missing |
metastase_ao_diagnostico_cid_o_1_2 has 4233 (99.1%) missing values | Missing |
metastase_ao_diagnostico_cid_o_2_1 has 3898 (91.2%) missing values | Missing |
metastase_ao_diagnostico_cid_o_2_2 has 4257 (99.6%) missing values | Missing |
metastase_ao_diagnostico_cid_o_3_1 has 4097 (95.9%) missing values | Missing |
metastase_ao_diagnostico_cid_o_3_2 has 4266 (99.9%) missing values | Missing |
metastase_ao_diagnostico_cid_o_4_1 has 4205 (98.4%) missing values | Missing |
metastase_ao_diagnostico_cid_o_4_2 has 4270 (> 99.9%) missing values | Missing |
data_do_tratamento_2 has 3928 (91.9%) missing values | Missing |
combinacao_dos_tratamentos_realizados_no_hospital_2 has 3903 (91.4%) missing values | Missing |
ano_do_diagnostico_2 has 3903 (91.4%) missing values | Missing |
lateralidade_do_tumor_2 has 3903 (91.4%) missing values | Missing |
data_de_recidiva_1 has 3023 (70.8%) missing values | Missing |
data_de_recidiva_2 has 4226 (98.9%) missing values | Missing |
tempo_desde_o_diagnostico_ate_a_primeira_recidiv_1 has 3023 (70.8%) missing values | Missing |
tempo_desde_o_diagnostico_ate_a_primeira_recidiv_2 has 4226 (98.9%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_1 has 3282 (76.8%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_2 has 4231 (99.0%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_1 has 3737 (87.5%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_2 has 4253 (99.6%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_1 has 4013 (93.9%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_2 has 4267 (99.9%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_1 has 4161 (97.4%) missing values | Missing |
local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_2 has 4271 (> 99.9%) missing values | Missing |
descricao_da_morfologia_de_acordo_com_cid_o_2 has 3903 (91.4%) missing values | Missing |
descricao_da_topografia_2 has 3903 (91.4%) missing values | Missing |
classificacao_tnm_patologico_n_1 has 4086 (95.6%) missing values | Missing |
classificacao_tnm_patologico_n_2 has 4267 (99.9%) missing values | Missing |
classificacao_tnm_patologico_t_1 has 4085 (95.6%) missing values | Missing |
classificacao_tnm_patologico_t_2 has 4267 (99.9%) missing values | Missing |
com_recidiva_a_distancia_2 has 3903 (91.4%) missing values | Missing |
com_recidiva_regional_2 has 3903 (91.4%) missing values | Missing |
com_recidiva_local_2 has 3903 (91.4%) missing values | Missing |
data_da_primeira_consulta_institucional_dt_pci_1 is uniformly distributed | Uniform |
data_da_primeira_consulta_institucional_dt_pci_2 is uniformly distributed | Uniform |
data_do_diagnostico_1 is uniformly distributed | Uniform |
data_do_diagnostico_2 is uniformly distributed | Uniform |
metastase_ao_diagnostico_cid_o_4_2 is uniformly distributed | Uniform |
data_do_tratamento_1 is uniformly distributed | Uniform |
data_do_tratamento_2 is uniformly distributed | Uniform |
data_de_recidiva_1 is uniformly distributed | Uniform |
data_de_recidiva_2 is uniformly distributed | Uniform |
record_id has unique values | Unique |
Reproduction
| Analysis started | 2023-02-28 14:19:40.172842 |
|---|---|
| Analysis finished | 2023-02-28 14:20:21.324659 |
| Duration | 41.15 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
record_id
Real number (ℝ)
| Distinct | 4272 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48652.36 |
| Minimum | 302 |
|---|---|
| Maximum | 82240 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 302 |
|---|---|
| 5-th percentile | 13992.4 |
| Q1 | 31013 |
| median | 53394 |
| Q3 | 65816.75 |
| 95-th percentile | 78668.25 |
| Maximum | 82240 |
| Range | 81938 |
| Interquartile range (IQR) | 34803.75 |
Descriptive statistics
| Standard deviation | 20659.52 |
|---|---|
| Coefficient of variation (CV) | 0.4246355 |
| Kurtosis | -0.99374558 |
| Mean | 48652.36 |
| Median Absolute Deviation (MAD) | 16732 |
| Skewness | -0.29501895 |
| Sum | 2.0784288 × 108 |
| Variance | 4.2681575 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 302 | 1 | < 0.1% |
| 60912 | 1 | < 0.1% |
| 60757 | 1 | < 0.1% |
| 60774 | 1 | < 0.1% |
| 60777 | 1 | < 0.1% |
| 60799 | 1 | < 0.1% |
| 60815 | 1 | < 0.1% |
| 60825 | 1 | < 0.1% |
| 60826 | 1 | < 0.1% |
| 60840 | 1 | < 0.1% |
| Other values (4262) | 4262 |
| Value | Count | Frequency (%) |
| 302 | 1 | |
| 710 | 1 | |
| 752 | 1 | |
| 1367 | 1 | |
| 1589 | 1 | |
| 1705 | 1 | |
| 1843 | 1 | |
| 1873 | 1 | |
| 1898 | 1 | |
| 1960 | 1 |
| Value | Count | Frequency (%) |
| 82240 | 1 | |
| 82205 | 1 | |
| 82131 | 1 | |
| 82124 | 1 | |
| 82123 | 1 | |
| 82122 | 1 | |
| 82118 | 1 | |
| 82112 | 1 | |
| 82111 | 1 | |
| 82100 | 1 |
repeat_instrument_1
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Registro De Tumores |
|---|
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 81168 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Registro De Tumores |
|---|---|
| 2nd row | Registro De Tumores |
| 3rd row | Registro De Tumores |
| 4th row | Registro De Tumores |
| 5th row | Registro De Tumores |
Common Values
| Value | Count | Frequency (%) |
| Registro De Tumores | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| registro | 4272 | |
| de | 4272 | |
| tumores | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12816 | |
| s | 8544 | |
| r | 8544 | |
| o | 8544 | |
| 8544 | ||
| R | 4272 | 5.3% |
| g | 4272 | 5.3% |
| i | 4272 | 5.3% |
| t | 4272 | 5.3% |
| D | 4272 | 5.3% |
| Other values (3) | 12816 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 59808 | |
| Uppercase Letter | 12816 | 15.8% |
| Space Separator | 8544 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12816 | |
| s | 8544 | |
| r | 8544 | |
| o | 8544 | |
| g | 4272 | 7.1% |
| i | 4272 | 7.1% |
| t | 4272 | 7.1% |
| u | 4272 | 7.1% |
| m | 4272 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 4272 | |
| D | 4272 | |
| T | 4272 |
Space Separator
| Value | Count | Frequency (%) |
| 8544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72624 | |
| Common | 8544 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12816 | |
| s | 8544 | |
| r | 8544 | |
| o | 8544 | |
| R | 4272 | 5.9% |
| g | 4272 | 5.9% |
| i | 4272 | 5.9% |
| t | 4272 | 5.9% |
| D | 4272 | 5.9% |
| T | 4272 | 5.9% |
| Other values (2) | 8544 |
Common
| Value | Count | Frequency (%) |
| 8544 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81168 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 12816 | |
| s | 8544 | |
| r | 8544 | |
| o | 8544 | |
| 8544 | ||
| R | 4272 | 5.3% |
| g | 4272 | 5.3% |
| i | 4272 | 5.3% |
| t | 4272 | 5.3% |
| D | 4272 | 5.3% |
| Other values (3) | 12816 |
repeat_instrument_2
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| Registro De Tumores |
|---|
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 7011 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Registro De Tumores |
|---|---|
| 2nd row | Registro De Tumores |
| 3rd row | Registro De Tumores |
| 4th row | Registro De Tumores |
| 5th row | Registro De Tumores |
Common Values
| Value | Count | Frequency (%) |
| Registro De Tumores | 369 | 8.6% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| registro | 369 | |
| de | 369 | |
| tumores | 369 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1107 | |
| s | 738 | |
| r | 738 | |
| o | 738 | |
| 738 | ||
| R | 369 | 5.3% |
| g | 369 | 5.3% |
| i | 369 | 5.3% |
| t | 369 | 5.3% |
| D | 369 | 5.3% |
| Other values (3) | 1107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5166 | |
| Uppercase Letter | 1107 | 15.8% |
| Space Separator | 738 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1107 | |
| s | 738 | |
| r | 738 | |
| o | 738 | |
| g | 369 | 7.1% |
| i | 369 | 7.1% |
| t | 369 | 7.1% |
| u | 369 | 7.1% |
| m | 369 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 369 | |
| D | 369 | |
| T | 369 |
Space Separator
| Value | Count | Frequency (%) |
| 738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6273 | |
| Common | 738 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1107 | |
| s | 738 | |
| r | 738 | |
| o | 738 | |
| R | 369 | 5.9% |
| g | 369 | 5.9% |
| i | 369 | 5.9% |
| t | 369 | 5.9% |
| D | 369 | 5.9% |
| T | 369 | 5.9% |
| Other values (2) | 738 |
Common
| Value | Count | Frequency (%) |
| 738 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7011 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1107 | |
| s | 738 | |
| r | 738 | |
| o | 738 | |
| 738 | ||
| R | 369 | 5.3% |
| g | 369 | 5.3% |
| i | 369 | 5.3% |
| t | 369 | 5.3% |
| D | 369 | 5.3% |
| Other values (3) | 1107 |
repeat_instance_1
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12816 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4272 | |
| . | 4272 | |
| 0 | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8544 | |
| Other Punctuation | 4272 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4272 | |
| 0 | 4272 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12816 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4272 | |
| . | 4272 | |
| 0 | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12816 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4272 | |
| . | 4272 | |
| 0 | 4272 |
repeat_instance_2
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 2.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1107 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 369 | 8.6% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 369 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 369 | |
| . | 369 | |
| 0 | 369 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 738 | |
| Other Punctuation | 369 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 369 | |
| 0 | 369 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 369 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1107 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 369 | |
| . | 369 | |
| 0 | 369 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1107 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 369 | |
| . | 369 | |
| 0 | 369 |
data_da_primeira_consulta_institucional_dt_pci_1
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 2463 |
|---|---|
| Distinct (%) | 57.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 2011-08-14 | 8 |
|---|---|
| 2017-05-12 | 7 |
| 2017-04-21 | 6 |
| 2018-01-02 | 6 |
| 2016-04-12 | 6 |
| Other values (2458) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 42720 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1328 ? |
|---|---|
| Unique (%) | 31.1% |
Sample
| 1st row | 2008-03-22 |
|---|---|
| 2nd row | 2006-11-11 |
| 3rd row | 2007-09-25 |
| 4th row | 2008-02-03 |
| 5th row | 2008-05-15 |
Common Values
| Value | Count | Frequency (%) |
| 2011-08-14 | 8 | 0.2% |
| 2017-05-12 | 7 | 0.2% |
| 2017-04-21 | 6 | 0.1% |
| 2018-01-02 | 6 | 0.1% |
| 2016-04-12 | 6 | 0.1% |
| 2014-11-15 | 6 | 0.1% |
| 2017-07-30 | 6 | 0.1% |
| 2017-06-10 | 6 | 0.1% |
| 2015-09-30 | 6 | 0.1% |
| 2016-06-09 | 6 | 0.1% |
| Other values (2453) | 4209 |
Length
| Value | Count | Frequency (%) |
| 2011-08-14 | 8 | 0.2% |
| 2017-05-12 | 7 | 0.2% |
| 2015-09-30 | 6 | 0.1% |
| 2017-11-07 | 6 | 0.1% |
| 2016-09-01 | 6 | 0.1% |
| 2016-02-18 | 6 | 0.1% |
| 2015-08-29 | 6 | 0.1% |
| 2016-06-09 | 6 | 0.1% |
| 2016-03-26 | 6 | 0.1% |
| 2017-06-10 | 6 | 0.1% |
| Other values (2453) | 4209 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10005 | |
| - | 8544 | |
| 1 | 8206 | |
| 2 | 7289 | |
| 5 | 1470 | 3.4% |
| 6 | 1441 | 3.4% |
| 7 | 1416 | 3.3% |
| 3 | 1381 | 3.2% |
| 4 | 1104 | 2.6% |
| 8 | 1003 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34176 | |
| Dash Punctuation | 8544 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10005 | |
| 1 | 8206 | |
| 2 | 7289 | |
| 5 | 1470 | 4.3% |
| 6 | 1441 | 4.2% |
| 7 | 1416 | 4.1% |
| 3 | 1381 | 4.0% |
| 4 | 1104 | 3.2% |
| 8 | 1003 | 2.9% |
| 9 | 861 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42720 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10005 | |
| - | 8544 | |
| 1 | 8206 | |
| 2 | 7289 | |
| 5 | 1470 | 3.4% |
| 6 | 1441 | 3.4% |
| 7 | 1416 | 3.3% |
| 3 | 1381 | 3.2% |
| 4 | 1104 | 2.6% |
| 8 | 1003 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10005 | |
| - | 8544 | |
| 1 | 8206 | |
| 2 | 7289 | |
| 5 | 1470 | 3.4% |
| 6 | 1441 | 3.4% |
| 7 | 1416 | 3.3% |
| 3 | 1381 | 3.2% |
| 4 | 1104 | 2.6% |
| 8 | 1003 | 2.3% |
data_da_primeira_consulta_institucional_dt_pci_2
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 349 |
|---|---|
| Distinct (%) | 94.6% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 2017-08-18 | 3 |
|---|---|
| 2017-10-03 | 2 |
| 2017-07-24 | 2 |
| 2016-04-24 | 2 |
| 2017-10-20 | 2 |
| Other values (344) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3690 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 330 ? |
|---|---|
| Unique (%) | 89.4% |
Sample
| 1st row | 2014-05-12 |
|---|---|
| 2nd row | 2009-06-29 |
| 3rd row | 2016-04-11 |
| 4th row | 2010-07-24 |
| 5th row | 2007-10-17 |
Common Values
| Value | Count | Frequency (%) |
| 2017-08-18 | 3 | 0.1% |
| 2017-10-03 | 2 | < 0.1% |
| 2017-07-24 | 2 | < 0.1% |
| 2016-04-24 | 2 | < 0.1% |
| 2017-10-20 | 2 | < 0.1% |
| 2016-12-25 | 2 | < 0.1% |
| 2018-02-07 | 2 | < 0.1% |
| 2016-03-26 | 2 | < 0.1% |
| 2016-08-27 | 2 | < 0.1% |
| 2016-09-28 | 2 | < 0.1% |
| Other values (339) | 348 | 8.1% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| 2017-08-18 | 3 | 0.8% |
| 2016-04-22 | 2 | 0.5% |
| 2017-10-03 | 2 | 0.5% |
| 2018-06-01 | 2 | 0.5% |
| 2014-05-12 | 2 | 0.5% |
| 2016-01-26 | 2 | 0.5% |
| 2016-04-11 | 2 | 0.5% |
| 2018-04-16 | 2 | 0.5% |
| 2016-04-15 | 2 | 0.5% |
| 2017-12-18 | 2 | 0.5% |
| Other values (339) | 348 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 841 | |
| - | 738 | |
| 1 | 682 | |
| 2 | 624 | |
| 7 | 150 | 4.1% |
| 6 | 133 | 3.6% |
| 8 | 116 | 3.1% |
| 4 | 114 | 3.1% |
| 3 | 110 | 3.0% |
| 5 | 99 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2952 | |
| Dash Punctuation | 738 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 841 | |
| 1 | 682 | |
| 2 | 624 | |
| 7 | 150 | 5.1% |
| 6 | 133 | 4.5% |
| 8 | 116 | 3.9% |
| 4 | 114 | 3.9% |
| 3 | 110 | 3.7% |
| 5 | 99 | 3.4% |
| 9 | 83 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3690 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 841 | |
| - | 738 | |
| 1 | 682 | |
| 2 | 624 | |
| 7 | 150 | 4.1% |
| 6 | 133 | 3.6% |
| 8 | 116 | 3.1% |
| 4 | 114 | 3.1% |
| 3 | 110 | 3.0% |
| 5 | 99 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3690 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 841 | |
| - | 738 | |
| 1 | 682 | |
| 2 | 624 | |
| 7 | 150 | 4.1% |
| 6 | 133 | 3.6% |
| 8 | 116 | 3.1% |
| 4 | 114 | 3.1% |
| 3 | 110 | 3.0% |
| 5 | 99 | 2.7% |
data_do_diagnostico_1
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 2460 |
|---|---|
| Distinct (%) | 57.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 2011-02-01 | 7 |
|---|---|
| 2012-03-17 | 7 |
| 2015-08-02 | 7 |
| 2016-05-27 | 7 |
| 2015-08-13 | 6 |
| Other values (2455) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 42720 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1312 ? |
|---|---|
| Unique (%) | 30.7% |
Sample
| 1st row | 2008-03-23 |
|---|---|
| 2nd row | 2007-11-11 |
| 3rd row | 2007-12-18 |
| 4th row | 2008-02-06 |
| 5th row | 2008-05-21 |
Common Values
| Value | Count | Frequency (%) |
| 2011-02-01 | 7 | 0.2% |
| 2012-03-17 | 7 | 0.2% |
| 2015-08-02 | 7 | 0.2% |
| 2016-05-27 | 7 | 0.2% |
| 2015-08-13 | 6 | 0.1% |
| 2015-07-06 | 6 | 0.1% |
| 2015-06-17 | 6 | 0.1% |
| 2012-03-29 | 6 | 0.1% |
| 2015-06-10 | 6 | 0.1% |
| 2016-07-23 | 6 | 0.1% |
| Other values (2450) | 4208 |
Length
| Value | Count | Frequency (%) |
| 2011-02-01 | 7 | 0.2% |
| 2015-08-02 | 7 | 0.2% |
| 2016-05-27 | 7 | 0.2% |
| 2012-03-17 | 7 | 0.2% |
| 2015-08-13 | 6 | 0.1% |
| 2015-07-06 | 6 | 0.1% |
| 2015-06-17 | 6 | 0.1% |
| 2012-03-29 | 6 | 0.1% |
| 2015-06-10 | 6 | 0.1% |
| 2016-07-23 | 6 | 0.1% |
| Other values (2450) | 4208 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10021 | |
| - | 8544 | |
| 1 | 8145 | |
| 2 | 7255 | |
| 6 | 1443 | 3.4% |
| 5 | 1440 | 3.4% |
| 3 | 1426 | 3.3% |
| 7 | 1398 | 3.3% |
| 4 | 1116 | 2.6% |
| 8 | 968 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34176 | |
| Dash Punctuation | 8544 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10021 | |
| 1 | 8145 | |
| 2 | 7255 | |
| 6 | 1443 | 4.2% |
| 5 | 1440 | 4.2% |
| 3 | 1426 | 4.2% |
| 7 | 1398 | 4.1% |
| 4 | 1116 | 3.3% |
| 8 | 968 | 2.8% |
| 9 | 964 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42720 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10021 | |
| - | 8544 | |
| 1 | 8145 | |
| 2 | 7255 | |
| 6 | 1443 | 3.4% |
| 5 | 1440 | 3.4% |
| 3 | 1426 | 3.3% |
| 7 | 1398 | 3.3% |
| 4 | 1116 | 2.6% |
| 8 | 968 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10021 | |
| - | 8544 | |
| 1 | 8145 | |
| 2 | 7255 | |
| 6 | 1443 | 3.4% |
| 5 | 1440 | 3.4% |
| 3 | 1426 | 3.3% |
| 7 | 1398 | 3.3% |
| 4 | 1116 | 2.6% |
| 8 | 968 | 2.3% |
data_do_diagnostico_2
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 358 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 2017-12-17 | 2 |
|---|---|
| 2017-06-10 | 2 |
| 2018-07-28 | 2 |
| 2017-07-06 | 2 |
| 2014-01-28 | 2 |
| Other values (353) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3690 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 347 ? |
|---|---|
| Unique (%) | 94.0% |
Sample
| 1st row | 2014-05-15 |
|---|---|
| 2nd row | 2009-08-24 |
| 3rd row | 2016-05-04 |
| 4th row | 2008-07-30 |
| 5th row | 2007-12-06 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12-17 | 2 | < 0.1% |
| 2017-06-10 | 2 | < 0.1% |
| 2018-07-28 | 2 | < 0.1% |
| 2017-07-06 | 2 | < 0.1% |
| 2014-01-28 | 2 | < 0.1% |
| 2020-01-09 | 2 | < 0.1% |
| 2010-12-10 | 2 | < 0.1% |
| 2016-02-17 | 2 | < 0.1% |
| 2008-11-19 | 2 | < 0.1% |
| 2013-02-23 | 2 | < 0.1% |
| Other values (348) | 349 | 8.2% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| 2017-12-17 | 2 | 0.5% |
| 2010-12-10 | 2 | 0.5% |
| 2017-06-10 | 2 | 0.5% |
| 2013-02-23 | 2 | 0.5% |
| 2008-11-19 | 2 | 0.5% |
| 2016-02-17 | 2 | 0.5% |
| 2010-09-21 | 2 | 0.5% |
| 2020-01-09 | 2 | 0.5% |
| 2014-01-28 | 2 | 0.5% |
| 2017-07-06 | 2 | 0.5% |
| Other values (348) | 349 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 848 | |
| - | 738 | |
| 1 | 686 | |
| 2 | 613 | |
| 7 | 146 | 4.0% |
| 6 | 128 | 3.5% |
| 5 | 119 | 3.2% |
| 3 | 114 | 3.1% |
| 4 | 106 | 2.9% |
| 9 | 101 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2952 | |
| Dash Punctuation | 738 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 848 | |
| 1 | 686 | |
| 2 | 613 | |
| 7 | 146 | 4.9% |
| 6 | 128 | 4.3% |
| 5 | 119 | 4.0% |
| 3 | 114 | 3.9% |
| 4 | 106 | 3.6% |
| 9 | 101 | 3.4% |
| 8 | 91 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3690 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 848 | |
| - | 738 | |
| 1 | 686 | |
| 2 | 613 | |
| 7 | 146 | 4.0% |
| 6 | 128 | 3.5% |
| 5 | 119 | 3.2% |
| 3 | 114 | 3.1% |
| 4 | 106 | 2.9% |
| 9 | 101 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3690 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 848 | |
| - | 738 | |
| 1 | 686 | |
| 2 | 613 | |
| 7 | 146 | 4.0% |
| 6 | 128 | 3.5% |
| 5 | 119 | 3.2% |
| 3 | 114 | 3.1% |
| 4 | 106 | 2.9% |
| 9 | 101 | 2.7% |
codigo_da_topografia_cid_o_1
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| C509 | |
|---|---|
| C504 | |
| C502 | |
| C505 | |
| C508 | 175 |
| Other values (8) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 17088 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | C504 |
|---|---|
| 2nd row | C508 |
| 3rd row | C509 |
| 4th row | C505 |
| 5th row | C508 |
Common Values
| Value | Count | Frequency (%) |
| C509 | 1927 | |
| C504 | 1155 | |
| C502 | 282 | 6.6% |
| C505 | 247 | 5.8% |
| C508 | 175 | 4.1% |
| C503 | 174 | 4.1% |
| C500 | 168 | 3.9% |
| C501 | 131 | 3.1% |
| C506 | 9 | 0.2% |
| C049 | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
Length
| Value | Count | Frequency (%) |
| c509 | 1927 | |
| c504 | 1155 | |
| c502 | 282 | 6.6% |
| c505 | 247 | 5.8% |
| c508 | 175 | 4.1% |
| c503 | 174 | 4.1% |
| c500 | 168 | 3.9% |
| c501 | 131 | 3.1% |
| c506 | 9 | 0.2% |
| c049 | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 4516 | |
| 0 | 4439 | |
| C | 4272 | |
| 9 | 1930 | |
| 4 | 1156 | 6.8% |
| 2 | 283 | 1.7% |
| 8 | 176 | 1.0% |
| 3 | 175 | 1.0% |
| 1 | 132 | 0.8% |
| 6 | 9 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12816 | |
| Uppercase Letter | 4272 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 4516 | |
| 0 | 4439 | |
| 9 | 1930 | |
| 4 | 1156 | 9.0% |
| 2 | 283 | 2.2% |
| 8 | 176 | 1.4% |
| 3 | 175 | 1.4% |
| 1 | 132 | 1.0% |
| 6 | 9 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12816 | |
| Latin | 4272 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 4516 | |
| 0 | 4439 | |
| 9 | 1930 | |
| 4 | 1156 | 9.0% |
| 2 | 283 | 2.2% |
| 8 | 176 | 1.4% |
| 3 | 175 | 1.4% |
| 1 | 132 | 1.0% |
| 6 | 9 | 0.1% |
Latin
| Value | Count | Frequency (%) |
| C | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17088 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 4516 | |
| 0 | 4439 | |
| C | 4272 | |
| 9 | 1930 | |
| 4 | 1156 | 6.8% |
| 2 | 283 | 1.7% |
| 8 | 176 | 1.0% |
| 3 | 175 | 1.0% |
| 1 | 132 | 0.8% |
| 6 | 9 | 0.1% |
codigo_da_topografia_cid_o_2
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 72 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| C509 | |
|---|---|
| C504 | |
| C739 | 17 |
| C502 | 15 |
| C446 | 12 |
| Other values (67) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1476 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | C539 |
|---|---|
| 2nd row | C186 |
| 3rd row | C509 |
| 4th row | C445 |
| 5th row | C447 |
Common Values
| Value | Count | Frequency (%) |
| C509 | 87 | 2.0% |
| C504 | 42 | 1.0% |
| C739 | 17 | 0.4% |
| C502 | 15 | 0.4% |
| C446 | 12 | 0.3% |
| C649 | 12 | 0.3% |
| C503 | 12 | 0.3% |
| C508 | 11 | 0.3% |
| C505 | 10 | 0.2% |
| C443 | 9 | 0.2% |
| Other values (62) | 142 | 3.3% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| c509 | 87 | |
| c504 | 42 | 11.4% |
| c739 | 17 | 4.6% |
| c502 | 15 | 4.1% |
| c446 | 12 | 3.3% |
| c649 | 12 | 3.3% |
| c503 | 12 | 3.3% |
| c508 | 11 | 3.0% |
| c505 | 10 | 2.7% |
| c443 | 9 | 2.4% |
| Other values (62) | 142 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 369 | |
| 5 | 234 | |
| 0 | 222 | |
| 4 | 178 | |
| 9 | 155 | |
| 3 | 78 | 5.3% |
| 1 | 71 | 4.8% |
| 6 | 54 | 3.7% |
| 2 | 52 | 3.5% |
| 7 | 36 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1107 | |
| Uppercase Letter | 369 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 234 | |
| 0 | 222 | |
| 4 | 178 | |
| 9 | 155 | |
| 3 | 78 | 7.0% |
| 1 | 71 | 6.4% |
| 6 | 54 | 4.9% |
| 2 | 52 | 4.7% |
| 7 | 36 | 3.3% |
| 8 | 27 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 369 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1107 | |
| Latin | 369 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 234 | |
| 0 | 222 | |
| 4 | 178 | |
| 9 | 155 | |
| 3 | 78 | 7.0% |
| 1 | 71 | 6.4% |
| 6 | 54 | 4.9% |
| 2 | 52 | 4.7% |
| 7 | 36 | 3.3% |
| 8 | 27 | 2.4% |
Latin
| Value | Count | Frequency (%) |
| C | 369 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1476 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 369 | |
| 5 | 234 | |
| 0 | 222 | |
| 4 | 178 | |
| 9 | 155 | |
| 3 | 78 | 5.3% |
| 1 | 71 | 4.8% |
| 6 | 54 | 3.7% |
| 2 | 52 | 3.5% |
| 7 | 36 | 2.4% |
codigo_da_morfologia_de_acordo_com_o_cid_o_1
Real number (ℝ)
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84952.631 |
| Minimum | 80103 |
|---|---|
| Maximum | 97553 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 80103 |
|---|---|
| 5-th percentile | 85003 |
| Q1 | 85003 |
| median | 85003 |
| Q3 | 85003 |
| 95-th percentile | 85203 |
| Maximum | 97553 |
| Range | 17450 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 657.00693 |
|---|---|
| Coefficient of variation (CV) | 0.0077338032 |
| Kurtosis | 68.057451 |
| Mean | 84952.631 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.4878973 |
| Sum | 3.6291764 × 108 |
| Variance | 431658.11 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85003 | 3793 | |
| 85203 | 140 | 3.3% |
| 84803 | 49 | 1.1% |
| 85753 | 46 | 1.1% |
| 80503 | 38 | 0.9% |
| 85002 | 30 | 0.7% |
| 85033 | 19 | 0.4% |
| 85503 | 19 | 0.4% |
| 85233 | 18 | 0.4% |
| 85223 | 14 | 0.3% |
| Other values (33) | 106 | 2.5% |
| Value | Count | Frequency (%) |
| 80103 | 9 | 0.2% |
| 80203 | 1 | < 0.1% |
| 80223 | 1 | < 0.1% |
| 80333 | 1 | < 0.1% |
| 80502 | 3 | 0.1% |
| 80503 | 38 | |
| 80703 | 5 | 0.1% |
| 80713 | 1 | < 0.1% |
| 81403 | 4 | 0.1% |
| 82003 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 97553 | 1 | < 0.1% |
| 90203 | 5 | 0.1% |
| 89803 | 1 | < 0.1% |
| 88903 | 1 | < 0.1% |
| 88323 | 1 | < 0.1% |
| 88003 | 2 | < 0.1% |
| 85753 | 46 | |
| 85503 | 19 | |
| 85433 | 2 | < 0.1% |
| 85413 | 8 | 0.2% |
codigo_da_morfologia_de_acordo_com_o_cid_o_2
Real number (ℝ)
| Distinct | 73 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84178.667 |
| Minimum | 80103 |
|---|---|
| Maximum | 99873 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 80103 |
|---|---|
| 5-th percentile | 80703 |
| Q1 | 82113 |
| median | 85002 |
| Q3 | 85003 |
| 95-th percentile | 87785 |
| Maximum | 99873 |
| Range | 19770 |
| Interquartile range (IQR) | 2890 |
Descriptive statistics
| Standard deviation | 2990.1006 |
|---|---|
| Coefficient of variation (CV) | 0.035520884 |
| Kurtosis | 9.1598417 |
| Mean | 84178.667 |
| Median Absolute Deviation (MAD) | 501 |
| Skewness | 2.3002379 |
| Sum | 31061928 |
| Variance | 8940701.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85003 | 111 | 2.6% |
| 85002 | 43 | 1.0% |
| 80703 | 25 | 0.6% |
| 81403 | 23 | 0.5% |
| 80973 | 13 | 0.3% |
| 85203 | 11 | 0.3% |
| 83123 | 11 | 0.3% |
| 82113 | 10 | 0.2% |
| 82603 | 9 | 0.2% |
| 85503 | 8 | 0.2% |
| Other values (63) | 105 | 2.5% |
| (Missing) | 3903 |
| Value | Count | Frequency (%) |
| 80103 | 2 | < 0.1% |
| 80413 | 2 | < 0.1% |
| 80463 | 1 | < 0.1% |
| 80503 | 3 | 0.1% |
| 80702 | 4 | 0.1% |
| 80703 | 25 | |
| 80713 | 1 | < 0.1% |
| 80762 | 1 | < 0.1% |
| 80772 | 1 | < 0.1% |
| 80812 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 99873 | 1 | |
| 99203 | 1 | |
| 98663 | 2 | |
| 97323 | 1 | |
| 96993 | 1 | |
| 96803 | 2 | |
| 95403 | 1 | |
| 93913 | 1 | |
| 91813 | 1 | |
| 91203 | 1 |
estadio_clinico_1
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| IIA | |
|---|---|
| IIIA | |
| IIB | |
| IV | |
| IIIB | |
| Other values (9) |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 2.9824438 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12741 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | IIA |
|---|---|
| 2nd row | IIIA |
| 3rd row | IIA |
| 4th row | IIA |
| 5th row | IIB |
Common Values
| Value | Count | Frequency (%) |
| IIA | 911 | |
| IIIA | 722 | |
| IIB | 715 | |
| IV | 544 | |
| IIIB | 460 | |
| IA | 408 | |
| I | 240 | 5.6% |
| IIIC | 183 | 4.3% |
| 0 | 37 | 0.9% |
| IB | 36 | 0.8% |
| Other values (4) | 16 | 0.4% |
Length
| Value | Count | Frequency (%) |
| iia | 911 | |
| iiia | 722 | |
| iib | 715 | |
| iv | 544 | |
| iiib | 460 | |
| ia | 408 | |
| i | 240 | 5.6% |
| iiic | 183 | 4.3% |
| 0 | 37 | 0.9% |
| ib | 36 | 0.8% |
| Other values (9) | 39 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 8578 | |
| A | 2052 | 16.1% |
| B | 1212 | 9.5% |
| V | 545 | 4.3% |
| C | 183 | 1.4% |
| 0 | 37 | 0.3% |
| 23 | 0.2% | |
| : | 14 | 0.1% |
| Y | 11 | 0.1% |
| N | 11 | 0.1% |
| Other values (17) | 75 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12595 | |
| Lowercase Letter | 72 | 0.6% |
| Decimal Number | 37 | 0.3% |
| Space Separator | 23 | 0.2% |
| Other Punctuation | 14 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9 | |
| o | 9 | |
| n | 6 | 8.3% |
| i | 6 | 8.3% |
| s | 6 | 8.3% |
| r | 6 | 8.3% |
| ã | 3 | 4.2% |
| f | 3 | 4.2% |
| p | 3 | 4.2% |
| Ã | 3 | 4.2% |
| Other values (6) | 18 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 8578 | |
| A | 2052 | 16.3% |
| B | 1212 | 9.6% |
| V | 545 | 4.3% |
| C | 183 | 1.5% |
| Y | 11 | 0.1% |
| N | 11 | 0.1% |
| X | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37 |
Space Separator
| Value | Count | Frequency (%) |
| 23 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12667 | |
| Common | 74 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 8578 | |
| A | 2052 | 16.2% |
| B | 1212 | 9.6% |
| V | 545 | 4.3% |
| C | 183 | 1.4% |
| Y | 11 | 0.1% |
| N | 11 | 0.1% |
| e | 9 | 0.1% |
| o | 9 | 0.1% |
| n | 6 | < 0.1% |
| Other values (14) | 51 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 37 | |
| 23 | ||
| : | 14 | 18.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12735 | |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 8578 | |
| A | 2052 | 16.1% |
| B | 1212 | 9.5% |
| V | 545 | 4.3% |
| C | 183 | 1.4% |
| 0 | 37 | 0.3% |
| 23 | 0.2% | |
| : | 14 | 0.1% |
| Y | 11 | 0.1% |
| N | 11 | 0.1% |
| Other values (15) | 69 | 0.5% |
None
| Value | Count | Frequency (%) |
| ã | 3 | |
| Ã | 3 |
estadio_clinico_2
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| IA | |
| I | |
| IIA | |
| IV | |
| Other values (16) |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 2.4634146 |
| Min length | 1 |
Characters and Unicode
| Total characters | 909 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | IVB |
|---|---|
| 2nd row | IV |
| 3rd row | IIA |
| 4th row | IIA |
| 5th row | IA |
Common Values
| Value | Count | Frequency (%) |
| 0 | 60 | 1.4% |
| IA | 59 | 1.4% |
| I | 55 | 1.3% |
| IIA | 45 | 1.1% |
| IV | 32 | 0.7% |
| Y: NA | 16 | 0.4% |
| IIIB | 16 | 0.4% |
| IIB | 15 | 0.4% |
| III | 15 | 0.4% |
| II | 12 | 0.3% |
| Other values (11) | 44 | 1.0% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| 0 | 60 | |
| ia | 59 | |
| i | 55 | |
| iia | 45 | |
| iv | 32 | |
| y | 16 | 4.0% |
| na | 16 | 4.0% |
| iiib | 16 | 4.0% |
| iib | 15 | 3.8% |
| iii | 15 | 3.8% |
| Other values (16) | 68 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 452 | |
| A | 136 | 15.0% |
| 0 | 61 | 6.7% |
| V | 48 | 5.3% |
| B | 48 | 5.3% |
| 28 | 3.1% | |
| : | 19 | 2.1% |
| Y | 16 | 1.8% |
| N | 16 | 1.8% |
| o | 9 | 1.0% |
| Other values (19) | 76 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 727 | |
| Lowercase Letter | 72 | 7.9% |
| Decimal Number | 63 | 6.9% |
| Space Separator | 28 | 3.1% |
| Other Punctuation | 19 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9 | |
| e | 9 | |
| n | 6 | 8.3% |
| i | 6 | 8.3% |
| r | 6 | 8.3% |
| s | 6 | 8.3% |
| m | 3 | 4.2% |
| l | 3 | 4.2% |
| t | 3 | 4.2% |
| d | 3 | 4.2% |
| Other values (6) | 18 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 452 | |
| A | 136 | 18.7% |
| V | 48 | 6.6% |
| B | 48 | 6.6% |
| Y | 16 | 2.2% |
| N | 16 | 2.2% |
| C | 8 | 1.1% |
| X | 3 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 61 | |
| 1 | 1 | 1.6% |
| 2 | 1 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 28 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 799 | |
| Common | 110 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 452 | |
| A | 136 | 17.0% |
| V | 48 | 6.0% |
| B | 48 | 6.0% |
| Y | 16 | 2.0% |
| N | 16 | 2.0% |
| o | 9 | 1.1% |
| e | 9 | 1.1% |
| C | 8 | 1.0% |
| n | 6 | 0.8% |
| Other values (14) | 51 | 6.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 61 | |
| 28 | ||
| : | 19 | 17.3% |
| 1 | 1 | 0.9% |
| 2 | 1 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 903 | |
| None | 6 | 0.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 452 | |
| A | 136 | 15.1% |
| 0 | 61 | 6.8% |
| V | 48 | 5.3% |
| B | 48 | 5.3% |
| 28 | 3.1% | |
| : | 19 | 2.1% |
| Y | 16 | 1.8% |
| N | 16 | 1.8% |
| o | 9 | 1.0% |
| Other values (17) | 70 | 7.8% |
None
| Value | Count | Frequency (%) |
| Ã | 3 | |
| ã | 3 |
grupo_de_estadio_clinico_1
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 195 |
| Missing (%) | 4.6% |
| Memory size | 33.5 KiB |
| II | |
|---|---|
| III | |
| I | |
| IV | |
| 0 | 28 |
| Other values (2) | 12 |
Length
| Max length | 31 |
|---|---|
| Median length | 2 |
| Mean length | 2.1751288 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8868 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | II |
|---|---|
| 2nd row | III |
| 3rd row | II |
| 4th row | II |
| 5th row | II |
Common Values
| Value | Count | Frequency (%) |
| II | 1570 | |
| III | 1291 | |
| I | 663 | |
| IV | 513 | 12.0% |
| 0 | 28 | 0.7% |
| Y: Na | 9 | 0.2% |
| X - nao foi possivel determinar | 3 | 0.1% |
| (Missing) | 195 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ii | 1570 | |
| iii | 1291 | |
| i | 663 | |
| iv | 513 | 12.5% |
| 0 | 28 | 0.7% |
| y | 9 | 0.2% |
| na | 9 | 0.2% |
| x | 3 | 0.1% |
| 3 | 0.1% | |
| nao | 3 | 0.1% |
| Other values (3) | 9 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 8189 | |
| V | 513 | 5.8% |
| 0 | 28 | 0.3% |
| 24 | 0.3% | |
| a | 15 | 0.2% |
| e | 9 | 0.1% |
| i | 9 | 0.1% |
| o | 9 | 0.1% |
| N | 9 | 0.1% |
| : | 9 | 0.1% |
| Other values (13) | 54 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8723 | |
| Lowercase Letter | 81 | 0.9% |
| Decimal Number | 28 | 0.3% |
| Space Separator | 24 | 0.3% |
| Other Punctuation | 9 | 0.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 15 | |
| e | 9 | |
| i | 9 | |
| o | 9 | |
| n | 6 | 7.4% |
| s | 6 | 7.4% |
| r | 6 | 7.4% |
| f | 3 | 3.7% |
| p | 3 | 3.7% |
| v | 3 | 3.7% |
| Other values (4) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 8189 | |
| V | 513 | 5.9% |
| N | 9 | 0.1% |
| Y | 9 | 0.1% |
| X | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 28 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 9 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8804 | |
| Common | 64 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 8189 | |
| V | 513 | 5.8% |
| a | 15 | 0.2% |
| e | 9 | 0.1% |
| i | 9 | 0.1% |
| o | 9 | 0.1% |
| N | 9 | 0.1% |
| Y | 9 | 0.1% |
| n | 6 | 0.1% |
| s | 6 | 0.1% |
| Other values (9) | 30 | 0.3% |
Common
| Value | Count | Frequency (%) |
| 0 | 28 | |
| 24 | ||
| : | 9 | 14.1% |
| - | 3 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8868 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 8189 | |
| V | 513 | 5.8% |
| 0 | 28 | 0.3% |
| 24 | 0.3% | |
| a | 15 | 0.2% |
| e | 9 | 0.1% |
| i | 9 | 0.1% |
| o | 9 | 0.1% |
| N | 9 | 0.1% |
| : | 9 | 0.1% |
| Other values (13) | 54 | 0.6% |
grupo_de_estadio_clinico_2
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 3959 |
| Missing (%) | 92.7% |
| Memory size | 33.5 KiB |
| I | |
|---|---|
| II | |
| 0 | |
| III | |
| IV | |
| Other values (2) |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.9456869 |
| Min length | 1 |
Characters and Unicode
| Total characters | 609 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IV |
|---|---|
| 2nd row | IV |
| 3rd row | II |
| 4th row | I |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| I | 108 | 2.5% |
| II | 63 | 1.5% |
| 0 | 49 | 1.1% |
| III | 40 | 0.9% |
| IV | 37 | 0.9% |
| Y: Na | 14 | 0.3% |
| X - nao foi possivel determinar | 2 | < 0.1% |
| (Missing) | 3959 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| i | 108 | |
| ii | 63 | |
| 0 | 49 | |
| iii | 40 | 11.9% |
| iv | 37 | 11.0% |
| y | 14 | 4.2% |
| na | 14 | 4.2% |
| x | 2 | 0.6% |
| 2 | 0.6% | |
| nao | 2 | 0.6% |
| Other values (3) | 6 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 391 | |
| 0 | 49 | 8.0% |
| V | 37 | 6.1% |
| 24 | 3.9% | |
| a | 18 | 3.0% |
| Y | 14 | 2.3% |
| : | 14 | 2.3% |
| N | 14 | 2.3% |
| e | 6 | 1.0% |
| i | 6 | 1.0% |
| Other values (13) | 36 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 458 | |
| Lowercase Letter | 62 | 10.2% |
| Decimal Number | 49 | 8.0% |
| Space Separator | 24 | 3.9% |
| Other Punctuation | 14 | 2.3% |
| Dash Punctuation | 2 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18 | |
| e | 6 | 9.7% |
| i | 6 | 9.7% |
| o | 6 | 9.7% |
| n | 4 | 6.5% |
| s | 4 | 6.5% |
| r | 4 | 6.5% |
| f | 2 | 3.2% |
| p | 2 | 3.2% |
| v | 2 | 3.2% |
| Other values (4) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 391 | |
| V | 37 | 8.1% |
| Y | 14 | 3.1% |
| N | 14 | 3.1% |
| X | 2 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 49 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 14 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 520 | |
| Common | 89 | 14.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 391 | |
| V | 37 | 7.1% |
| a | 18 | 3.5% |
| Y | 14 | 2.7% |
| N | 14 | 2.7% |
| e | 6 | 1.2% |
| i | 6 | 1.2% |
| o | 6 | 1.2% |
| n | 4 | 0.8% |
| s | 4 | 0.8% |
| Other values (9) | 20 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 0 | 49 | |
| 24 | ||
| : | 14 | 15.7% |
| - | 2 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 609 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 391 | |
| 0 | 49 | 8.0% |
| V | 37 | 6.1% |
| 24 | 3.9% | |
| a | 18 | 3.0% |
| Y | 14 | 2.3% |
| : | 14 | 2.3% |
| N | 14 | 2.3% |
| e | 6 | 1.0% |
| i | 6 | 1.0% |
| Other values (13) | 36 | 5.9% |
classificacao_tnm_clinico_t_1
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 2 | |
|---|---|
| 3 | |
| 1C | |
| 4B | |
| 4D | 135 |
| Other values (13) |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.5414326 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6585 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 3 | 854 | |
| 1C | 579 | 13.6% |
| 4B | 489 | 11.4% |
| 4D | 135 | 3.2% |
| 1B | 133 | 3.1% |
| 4 | 128 | 3.0% |
| 1 | 126 | 2.9% |
| 1A | 73 | 1.7% |
| 4C | 42 | 1.0% |
| Other values (8) | 104 | 2.4% |
Length
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 3 | 854 | |
| 1c | 579 | 13.2% |
| 4b | 489 | 11.1% |
| 4d | 135 | 3.1% |
| 1b | 133 | 3.0% |
| 4 | 128 | 2.9% |
| 1 | 126 | 2.9% |
| 1a | 73 | 1.7% |
| 4c | 42 | 1.0% |
| Other values (14) | 235 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 1 | 917 | |
| 3 | 854 | |
| 4 | 817 | |
| C | 637 | 9.7% |
| B | 622 | 9.4% |
| D | 144 | 2.2% |
| 131 | 2.0% | |
| A | 96 | 1.5% |
| o | 72 | 1.1% |
| Other values (23) | 686 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4200 | |
| Uppercase Letter | 1632 | 24.8% |
| Lowercase Letter | 587 | 8.9% |
| Space Separator | 131 | 2.0% |
| Dash Punctuation | 24 | 0.4% |
| Other Punctuation | 11 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 72 | |
| i | 72 | |
| e | 72 | |
| a | 59 | |
| r | 48 | |
| n | 48 | |
| s | 48 | |
| m | 24 | 4.1% |
| l | 24 | 4.1% |
| t | 24 | 4.1% |
| Other values (4) | 96 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 637 | |
| B | 622 | |
| D | 144 | 8.8% |
| A | 96 | 5.9% |
| I | 43 | 2.6% |
| S | 37 | 2.3% |
| X | 24 | 1.5% |
| Y | 11 | 0.7% |
| N | 11 | 0.7% |
| M | 6 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 1 | 917 | |
| 3 | 854 | |
| 4 | 817 | |
| 0 | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 131 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 24 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4366 | |
| Latin | 2219 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 637 | |
| B | 622 | |
| D | 144 | 6.5% |
| A | 96 | 4.3% |
| o | 72 | 3.2% |
| i | 72 | 3.2% |
| e | 72 | 3.2% |
| a | 59 | 2.7% |
| r | 48 | 2.2% |
| n | 48 | 2.2% |
| Other values (15) | 349 |
Common
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 1 | 917 | |
| 3 | 854 | |
| 4 | 817 | |
| 131 | 3.0% | |
| - | 24 | 0.5% |
| : | 11 | 0.3% |
| 0 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6585 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1609 | |
| 1 | 917 | |
| 3 | 854 | |
| 4 | 817 | |
| C | 637 | 9.7% |
| B | 622 | 9.4% |
| D | 144 | 2.2% |
| 131 | 2.0% | |
| A | 96 | 1.5% |
| o | 72 | 1.1% |
| Other values (23) | 686 |
classificacao_tnm_clinico_t_2
Categorical
| Distinct | 19 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 2 | |
|---|---|
| 1 | |
| IS | |
| 3 | |
| 1C | |
| Other values (14) |
Length
| Max length | 31 |
|---|---|
| Median length | 5 |
| Mean length | 3.7940379 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1400 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | X - nao foi possivel determinar |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 3A |
| 5th row | 1A |
Common Values
| Value | Count | Frequency (%) |
| 2 | 60 | 1.4% |
| 1 | 43 | 1.0% |
| IS | 38 | 0.9% |
| 3 | 36 | 0.8% |
| 1C | 30 | 0.7% |
| 1B | 27 | 0.6% |
| X - nao foi possivel determinar | 25 | 0.6% |
| 1A | 23 | 0.5% |
| CDIS | 21 | 0.5% |
| Y: Na | 16 | 0.4% |
| Other values (9) | 50 | 1.2% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| 2 | 60 | 11.8% |
| 1 | 43 | 8.4% |
| is | 38 | 7.5% |
| 3 | 36 | 7.1% |
| 1c | 30 | 5.9% |
| 1b | 27 | 5.3% |
| x | 25 | 4.9% |
| 25 | 4.9% | |
| nao | 25 | 4.9% |
| foi | 25 | 4.9% |
| Other values (15) | 176 |
Most occurring characters
| Value | Count | Frequency (%) |
| 141 | 10.1% | |
| 1 | 123 | 8.8% |
| o | 75 | 5.4% |
| e | 75 | 5.4% |
| i | 75 | 5.4% |
| 2 | 68 | 4.9% |
| a | 66 | 4.7% |
| I | 59 | 4.2% |
| S | 59 | 4.2% |
| C | 51 | 3.6% |
| Other values (20) | 608 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 616 | |
| Uppercase Letter | 334 | |
| Decimal Number | 268 | |
| Space Separator | 141 | 10.1% |
| Dash Punctuation | 25 | 1.8% |
| Other Punctuation | 16 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 75 | |
| e | 75 | |
| i | 75 | |
| a | 66 | |
| s | 50 | |
| r | 50 | |
| n | 50 | |
| v | 25 | 4.1% |
| f | 25 | 4.1% |
| l | 25 | 4.1% |
| Other values (4) | 100 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 59 | |
| S | 59 | |
| C | 51 | |
| B | 41 | |
| A | 41 | |
| D | 26 | |
| X | 25 | |
| Y | 16 | 4.8% |
| N | 16 | 4.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 123 | |
| 2 | 68 | |
| 3 | 43 | 16.0% |
| 4 | 34 | 12.7% |
Space Separator
| Value | Count | Frequency (%) |
| 141 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 950 | |
| Common | 450 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 75 | 7.9% |
| e | 75 | 7.9% |
| i | 75 | 7.9% |
| a | 66 | 6.9% |
| I | 59 | 6.2% |
| S | 59 | 6.2% |
| C | 51 | 5.4% |
| s | 50 | 5.3% |
| r | 50 | 5.3% |
| n | 50 | 5.3% |
| Other values (13) | 340 |
Common
| Value | Count | Frequency (%) |
| 141 | ||
| 1 | 123 | |
| 2 | 68 | |
| 3 | 43 | 9.6% |
| 4 | 34 | 7.6% |
| - | 25 | 5.6% |
| : | 16 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 141 | 10.1% | |
| 1 | 123 | 8.8% |
| o | 75 | 5.4% |
| e | 75 | 5.4% |
| i | 75 | 5.4% |
| 2 | 68 | 4.9% |
| a | 66 | 4.7% |
| I | 59 | 4.2% |
| S | 59 | 4.2% |
| C | 51 | 3.6% |
| Other values (20) | 608 |
classificacao_tnm_clinico_n_1
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| 2A | |
| 3 | 139 |
| Other values (6) |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.3167135 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5625 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 510 | 11.9% |
| 2A | 223 | 5.2% |
| 3 | 139 | 3.3% |
| 3A | 83 | 1.9% |
| 3C | 51 | 1.2% |
| 3B | 31 | 0.7% |
| X - nao foi possivel determinar | 30 | 0.7% |
| 2B | 21 | 0.5% |
Length
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 510 | 11.5% |
| 2a | 223 | 5.0% |
| 3 | 139 | 3.1% |
| 3a | 83 | 1.9% |
| 3c | 51 | 1.2% |
| 3b | 31 | 0.7% |
| foi | 30 | 0.7% |
| determinar | 30 | 0.7% |
| Other values (7) | 163 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 754 | |
| A | 306 | 5.4% |
| 3 | 304 | 5.4% |
| 161 | 2.9% | |
| i | 90 | 1.6% |
| o | 90 | 1.6% |
| e | 90 | 1.6% |
| a | 71 | 1.3% |
| Other values (17) | 586 | 10.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4231 | |
| Lowercase Letter | 731 | 13.0% |
| Uppercase Letter | 461 | 8.2% |
| Space Separator | 161 | 2.9% |
| Dash Punctuation | 30 | 0.5% |
| Other Punctuation | 11 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 90 | |
| o | 90 | |
| e | 90 | |
| a | 71 | |
| n | 60 | |
| s | 60 | |
| r | 60 | |
| l | 30 | 4.1% |
| m | 30 | 4.1% |
| t | 30 | 4.1% |
| Other values (4) | 120 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 306 | |
| B | 52 | 11.3% |
| C | 51 | 11.1% |
| X | 30 | 6.5% |
| Y | 11 | 2.4% |
| N | 11 | 2.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 754 | |
| 3 | 304 | 7.2% |
Space Separator
| Value | Count | Frequency (%) |
| 161 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4433 | |
| Latin | 1192 | 21.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 306 | |
| i | 90 | 7.6% |
| o | 90 | 7.6% |
| e | 90 | 7.6% |
| a | 71 | 6.0% |
| n | 60 | 5.0% |
| s | 60 | 5.0% |
| r | 60 | 5.0% |
| B | 52 | 4.4% |
| C | 51 | 4.3% |
| Other values (10) | 262 |
Common
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 754 | |
| 3 | 304 | 6.9% |
| 161 | 3.6% | |
| - | 30 | 0.7% |
| : | 11 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5625 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1778 | |
| 1 | 1395 | |
| 2 | 754 | |
| A | 306 | 5.4% |
| 3 | 304 | 5.4% |
| 161 | 2.9% | |
| i | 90 | 1.6% |
| o | 90 | 1.6% |
| e | 90 | 1.6% |
| a | 71 | 1.3% |
| Other values (17) | 586 | 10.4% |
classificacao_tnm_clinico_n_2
Categorical
IMBALANCE  MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 | |
| X - nao foi possivel determinar | |
| Y: Na | 16 |
| 2 | 8 |
| Other values (8) | 21 |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 3.3279133 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1228 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | X - nao foi possivel determinar |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 258 | 6.0% |
| 1 | 40 | 0.9% |
| X - nao foi possivel determinar | 26 | 0.6% |
| Y: Na | 16 | 0.4% |
| 2 | 8 | 0.2% |
| 3 | 6 | 0.1% |
| 2A | 4 | 0.1% |
| 1A | 3 | 0.1% |
| 3A | 2 | < 0.1% |
| 1B | 2 | < 0.1% |
| Other values (3) | 4 | 0.1% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 40 | 7.8% |
| x | 26 | 5.0% |
| 26 | 5.0% | |
| nao | 26 | 5.0% |
| foi | 26 | 5.0% |
| possivel | 26 | 5.0% |
| determinar | 26 | 5.0% |
| na | 16 | 3.1% |
| y | 16 | 3.1% |
| Other values (9) | 29 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 146 | ||
| o | 78 | 6.4% |
| i | 78 | 6.4% |
| e | 78 | 6.4% |
| a | 68 | 5.5% |
| n | 52 | 4.2% |
| r | 52 | 4.2% |
| s | 52 | 4.2% |
| 1 | 45 | 3.7% |
| Other values (17) | 321 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 640 | |
| Decimal Number | 327 | |
| Space Separator | 146 | 11.9% |
| Uppercase Letter | 73 | 5.9% |
| Dash Punctuation | 26 | 2.1% |
| Other Punctuation | 16 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 78 | |
| i | 78 | |
| e | 78 | |
| a | 68 | |
| n | 52 | |
| r | 52 | |
| s | 52 | |
| m | 26 | 4.1% |
| d | 26 | 4.1% |
| l | 26 | 4.1% |
| Other values (4) | 104 |
Uppercase Letter
| Value | Count | Frequency (%) |
| X | 26 | |
| Y | 16 | |
| N | 16 | |
| A | 9 | 12.3% |
| B | 5 | 6.8% |
| C | 1 | 1.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 1 | 45 | 13.8% |
| 2 | 14 | 4.3% |
| 3 | 10 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 146 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 713 | |
| Common | 515 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 78 | |
| i | 78 | |
| e | 78 | |
| a | 68 | 9.5% |
| n | 52 | 7.3% |
| r | 52 | 7.3% |
| s | 52 | 7.3% |
| m | 26 | 3.6% |
| d | 26 | 3.6% |
| l | 26 | 3.6% |
| Other values (10) | 177 |
Common
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 146 | ||
| 1 | 45 | 8.7% |
| - | 26 | 5.0% |
| : | 16 | 3.1% |
| 2 | 14 | 2.7% |
| 3 | 10 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 258 | |
| 146 | ||
| o | 78 | 6.4% |
| i | 78 | 6.4% |
| e | 78 | 6.4% |
| a | 68 | 5.5% |
| n | 52 | 4.2% |
| r | 52 | 4.2% |
| s | 52 | 4.2% |
| 1 | 45 | 3.7% |
| Other values (17) | 321 |
classificacao_tnm_clinico_m_1
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 | |
| Y: Na | 11 |
| X - nao foi possivel determinar | 3 |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.031367 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4406 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.8% |
| Y: Na | 11 | 0.3% |
| X - nao foi possivel determinar | 3 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.7% |
| y | 11 | 0.3% |
| na | 11 | 0.3% |
| x | 3 | 0.1% |
| 3 | 0.1% | |
| nao | 3 | 0.1% |
| foi | 3 | 0.1% |
| possivel | 3 | 0.1% |
| determinar | 3 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.4% |
| 26 | 0.6% | |
| a | 17 | 0.4% |
| Y | 11 | 0.2% |
| : | 11 | 0.2% |
| N | 11 | 0.2% |
| e | 9 | 0.2% |
| o | 9 | 0.2% |
| i | 9 | 0.2% |
| Other values (12) | 45 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4258 | |
| Lowercase Letter | 83 | 1.9% |
| Space Separator | 26 | 0.6% |
| Uppercase Letter | 25 | 0.6% |
| Other Punctuation | 11 | 0.2% |
| Dash Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 17 | |
| e | 9 | |
| o | 9 | |
| i | 9 | |
| r | 6 | 7.2% |
| n | 6 | 7.2% |
| s | 6 | 7.2% |
| t | 3 | 3.6% |
| d | 3 | 3.6% |
| l | 3 | 3.6% |
| Other values (4) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 11 | |
| N | 11 | |
| X | 3 | 12.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.8% |
Space Separator
| Value | Count | Frequency (%) |
| 26 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 11 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4298 | |
| Latin | 108 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 17 | |
| Y | 11 | |
| N | 11 | |
| e | 9 | |
| o | 9 | |
| i | 9 | |
| r | 6 | 5.6% |
| n | 6 | 5.6% |
| s | 6 | 5.6% |
| t | 3 | 2.8% |
| Other values (7) | 21 |
Common
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.7% |
| 26 | 0.6% | |
| : | 11 | 0.3% |
| - | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3712 | |
| 1 | 546 | 12.4% |
| 26 | 0.6% | |
| a | 17 | 0.4% |
| Y | 11 | 0.2% |
| : | 11 | 0.2% |
| N | 11 | 0.2% |
| e | 9 | 0.2% |
| o | 9 | 0.2% |
| i | 9 | 0.2% |
| Other values (12) | 45 | 1.0% |
classificacao_tnm_clinico_m_2
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 | 30 |
| Y: Na | 16 |
| 1B | 7 |
| X - nao foi possivel determinar | 3 |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.4417344 |
| Min length | 1 |
Characters and Unicode
| Total characters | 532 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 311 | 7.3% |
| 1 | 30 | 0.7% |
| Y: Na | 16 | 0.4% |
| 1B | 7 | 0.2% |
| X - nao foi possivel determinar | 3 | 0.1% |
| 1A | 2 | < 0.1% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 311 | |
| 1 | 30 | 7.5% |
| y | 16 | 4.0% |
| na | 16 | 4.0% |
| 1b | 7 | 1.8% |
| x | 3 | 0.8% |
| 3 | 0.8% | |
| nao | 3 | 0.8% |
| foi | 3 | 0.8% |
| possivel | 3 | 0.8% |
| Other values (2) | 5 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 311 | |
| 1 | 39 | 7.3% |
| 31 | 5.8% | |
| a | 22 | 4.1% |
| Y | 16 | 3.0% |
| : | 16 | 3.0% |
| N | 16 | 3.0% |
| o | 9 | 1.7% |
| i | 9 | 1.7% |
| e | 9 | 1.7% |
| Other values (14) | 54 | 10.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 350 | |
| Lowercase Letter | 88 | 16.5% |
| Uppercase Letter | 44 | 8.3% |
| Space Separator | 31 | 5.8% |
| Other Punctuation | 16 | 3.0% |
| Dash Punctuation | 3 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 22 | |
| o | 9 | |
| i | 9 | |
| e | 9 | |
| n | 6 | 6.8% |
| s | 6 | 6.8% |
| r | 6 | 6.8% |
| l | 3 | 3.4% |
| m | 3 | 3.4% |
| t | 3 | 3.4% |
| Other values (4) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 16 | |
| N | 16 | |
| B | 7 | |
| X | 3 | 6.8% |
| A | 2 | 4.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 311 | |
| 1 | 39 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 31 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 16 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 400 | |
| Latin | 132 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 22 | |
| Y | 16 | |
| N | 16 | |
| o | 9 | 6.8% |
| i | 9 | 6.8% |
| e | 9 | 6.8% |
| B | 7 | 5.3% |
| n | 6 | 4.5% |
| s | 6 | 4.5% |
| r | 6 | 4.5% |
| Other values (9) | 26 |
Common
| Value | Count | Frequency (%) |
| 0 | 311 | |
| 1 | 39 | 9.8% |
| 31 | 7.8% | |
| : | 16 | 4.0% |
| - | 3 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 311 | |
| 1 | 39 | 7.3% |
| 31 | 5.8% | |
| a | 22 | 4.1% |
| Y | 16 | 3.0% |
| : | 16 | 3.0% |
| N | 16 | 3.0% |
| o | 9 | 1.7% |
| i | 9 | 1.7% |
| e | 9 | 1.7% |
| Other values (14) | 54 | 10.2% |
metastase_ao_diagnostico_cid_o_1_1
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 3600 |
| Missing (%) | 84.3% |
| Memory size | 33.5 KiB |
| C34 - Bronquios e Pulmoes | |
|---|---|
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| Other values (9) |
Length
| Max length | 64 |
|---|---|
| Median length | 59 |
| Mean length | 45.309524 |
| Min length | 12 |
Characters and Unicode
| Total characters | 30448 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
|---|---|
| 2nd row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 3rd row | C34 - Bronquios e Pulmoes |
| 4th row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 5th row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
Common Values
| Value | Count | Frequency (%) |
| C34 - Bronquios e Pulmoes | 196 | 4.6% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 151 | 3.5% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 107 | 2.5% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 81 | 1.9% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 81 | 1.9% |
| C38 - Coração, Mediastino e Pleura, | 20 | 0.5% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 14 | 0.3% |
| C71 - Encefalo | 8 | 0.2% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 6 | 0.1% |
| C44 - Pele nao-melanoma | 3 | 0.1% |
| Other values (4) | 5 | 0.1% |
| (Missing) | 3600 |
Length
| Value | Count | Frequency (%) |
| 672 | 13.2% | |
| e | 657 | 12.9% |
| das | 258 | 5.1% |
| ossos | 232 | 4.6% |
| cartilagens | 232 | 4.6% |
| articulares | 232 | 4.6% |
| c34 | 196 | 3.9% |
| bronquios | 196 | 3.9% |
| pulmoes | 196 | 3.9% |
| dos | 162 | 3.2% |
| Other values (46) | 2044 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4405 | ||
| s | 2986 | 9.8% |
| a | 2268 | 7.4% |
| e | 2233 | 7.3% |
| i | 1861 | 6.1% |
| o | 1781 | 5.8% |
| r | 1510 | 5.0% |
| l | 1057 | 3.5% |
| t | 989 | 3.2% |
| C | 931 | 3.1% |
| Other values (51) | 10427 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20318 | |
| Space Separator | 4405 | 14.5% |
| Uppercase Letter | 3558 | 11.7% |
| Decimal Number | 1344 | 4.4% |
| Dash Punctuation | 782 | 2.6% |
| Other Punctuation | 41 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 2986 | |
| a | 2268 | |
| e | 2233 | |
| i | 1861 | |
| o | 1781 | |
| r | 1510 | 7.4% |
| l | 1057 | 5.2% |
| t | 989 | 4.9% |
| u | 889 | 4.4% |
| n | 854 | 4.2% |
| Other values (21) | 3890 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 931 | |
| D | 420 | |
| O | 391 | |
| B | 304 | 8.5% |
| P | 233 | 6.5% |
| L | 232 | 6.5% |
| A | 232 | 6.5% |
| M | 121 | 3.4% |
| I | 107 | 3.0% |
| V | 107 | 3.0% |
| Other values (7) | 480 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 455 | |
| 3 | 216 | |
| 2 | 215 | |
| 7 | 171 | 12.7% |
| 1 | 160 | 11.9% |
| 0 | 81 | 6.0% |
| 8 | 35 | 2.6% |
| 9 | 6 | 0.4% |
| 6 | 3 | 0.2% |
| 5 | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4405 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 782 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23876 | |
| Common | 6572 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 2986 | |
| a | 2268 | 9.5% |
| e | 2233 | 9.4% |
| i | 1861 | 7.8% |
| o | 1781 | 7.5% |
| r | 1510 | 6.3% |
| l | 1057 | 4.4% |
| t | 989 | 4.1% |
| C | 931 | 3.9% |
| u | 889 | 3.7% |
| Other values (38) | 7371 |
Common
| Value | Count | Frequency (%) |
| 4405 | ||
| - | 782 | 11.9% |
| 4 | 455 | 6.9% |
| 3 | 216 | 3.3% |
| 2 | 215 | 3.3% |
| 7 | 171 | 2.6% |
| 1 | 160 | 2.4% |
| 0 | 81 | 1.2% |
| , | 41 | 0.6% |
| 8 | 35 | 0.5% |
| Other values (3) | 11 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29539 | |
| None | 909 | 3.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4405 | ||
| s | 2986 | 10.1% |
| a | 2268 | 7.7% |
| e | 2233 | 7.6% |
| i | 1861 | 6.3% |
| o | 1781 | 6.0% |
| r | 1510 | 5.1% |
| l | 1057 | 3.6% |
| t | 989 | 3.3% |
| C | 931 | 3.2% |
| Other values (43) | 9518 |
None
| Value | Count | Frequency (%) |
| á | 269 | |
| ç | 171 | |
| õ | 151 | |
| Ã | 107 | 11.8% |
| ã | 101 | 11.1% |
| â | 81 | 8.9% |
| ô | 28 | 3.1% |
| é | 1 | 0.1% |
metastase_ao_diagnostico_cid_o_1_2
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 4233 |
| Missing (%) | 99.1% |
| Memory size | 33.5 KiB |
| C34 - Bronquios e Pulmoes | |
|---|---|
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C71 - Encefalo | |
| Other values (4) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 41.846154 |
| Min length | 13 |
Characters and Unicode
| Total characters | 1632 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | C34 - Bronquios e Pulmoes |
|---|---|
| 2nd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 3rd row | C71 - Encefalo |
| 4th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 5th row | C15 - Esofago |
Common Values
| Value | Count | Frequency (%) |
| C34 - Bronquios e Pulmoes | 8 | 0.2% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 8 | 0.2% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 7 | 0.2% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 5 | 0.1% |
| C71 - Encefalo | 4 | 0.1% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 3 | 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 2 | < 0.1% |
| C15 - Esofago | 1 | < 0.1% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 1 | < 0.1% |
| (Missing) | 4233 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 39 | 14.2% | |
| e | 33 | 12.0% |
| dos | 12 | 4.4% |
| das | 11 | 4.0% |
| articulares | 10 | 3.6% |
| cartilagens | 10 | 3.6% |
| ossos | 10 | 3.6% |
| c34 | 8 | 2.9% |
| intra-hepáticas | 8 | 2.9% |
| biliares | 8 | 2.9% |
| Other values (31) | 125 |
Most occurring characters
| Value | Count | Frequency (%) |
| 235 | ||
| s | 146 | 8.9% |
| e | 116 | 7.1% |
| a | 115 | 7.0% |
| i | 103 | 6.3% |
| o | 99 | 6.1% |
| r | 78 | 4.8% |
| l | 56 | 3.4% |
| n | 53 | 3.2% |
| t | 50 | 3.1% |
| Other values (46) | 581 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1074 | |
| Space Separator | 235 | 14.4% |
| Uppercase Letter | 195 | 11.9% |
| Decimal Number | 78 | 4.8% |
| Dash Punctuation | 48 | 2.9% |
| Open Punctuation | 1 | 0.1% |
| Close Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 146 | |
| e | 116 | |
| a | 115 | |
| i | 103 | |
| o | 99 | |
| r | 78 | 7.3% |
| l | 56 | 5.2% |
| n | 53 | 4.9% |
| t | 50 | 4.7% |
| c | 47 | 4.4% |
| Other values (17) | 211 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 49 | |
| D | 23 | |
| B | 16 | 8.2% |
| O | 13 | 6.7% |
| A | 11 | 5.6% |
| E | 10 | 5.1% |
| P | 10 | 5.1% |
| M | 9 | 4.6% |
| F | 8 | 4.1% |
| L | 8 | 4.1% |
| Other values (7) | 38 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 21 | |
| 2 | 16 | |
| 7 | 15 | |
| 1 | 8 | 10.3% |
| 3 | 8 | 10.3% |
| 0 | 7 | 9.0% |
| 8 | 2 | 2.6% |
| 5 | 1 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 235 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1269 | |
| Common | 363 | 22.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 146 | 11.5% |
| e | 116 | 9.1% |
| a | 115 | 9.1% |
| i | 103 | 8.1% |
| o | 99 | 7.8% |
| r | 78 | 6.1% |
| l | 56 | 4.4% |
| n | 53 | 4.2% |
| t | 50 | 3.9% |
| C | 49 | 3.9% |
| Other values (34) | 404 |
Common
| Value | Count | Frequency (%) |
| 235 | ||
| - | 48 | 13.2% |
| 4 | 21 | 5.8% |
| 2 | 16 | 4.4% |
| 7 | 15 | 4.1% |
| 1 | 8 | 2.2% |
| 3 | 8 | 2.2% |
| 0 | 7 | 1.9% |
| 8 | 2 | 0.6% |
| 5 | 1 | 0.3% |
| Other values (2) | 2 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1584 | |
| None | 48 | 2.9% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 235 | ||
| s | 146 | 9.2% |
| e | 116 | 7.3% |
| a | 115 | 7.3% |
| i | 103 | 6.5% |
| o | 99 | 6.2% |
| r | 78 | 4.9% |
| l | 56 | 3.5% |
| n | 53 | 3.3% |
| t | 50 | 3.2% |
| Other values (39) | 533 |
None
| Value | Count | Frequency (%) |
| á | 18 | |
| Ã | 8 | |
| â | 7 | 14.6% |
| ã | 5 | 10.4% |
| ô | 4 | 8.3% |
| ç | 3 | 6.2% |
| õ | 3 | 6.2% |
metastase_ao_diagnostico_cid_o_2_1
Categorical
| Distinct | 17 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 3898 |
| Missing (%) | 91.2% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C34 - Bronquios e Pulmoes | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| Other values (12) |
Length
| Max length | 64 |
|---|---|
| Median length | 59 |
| Mean length | 49.264706 |
| Min length | 10 |
Characters and Unicode
| Total characters | 18425 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
|---|---|
| 2nd row | C34 - Bronquios e Pulmoes |
| 3rd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 4th row | C40 - Ossos e Cartilagens Articulares Dos Membros |
| 5th row | C71 - Encefalo |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 106 | 2.5% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 68 | 1.6% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 53 | 1.2% |
| C34 - Bronquios e Pulmoes | 52 | 1.2% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 35 | 0.8% |
| C38 - Coração, Mediastino e Pleura, | 23 | 0.5% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 11 | 0.3% |
| C71 - Encefalo | 8 | 0.2% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 4 | 0.1% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 3 | 0.1% |
| Other values (7) | 11 | 0.3% |
| (Missing) | 3898 |
Length
| Value | Count | Frequency (%) |
| 374 | 12.5% | |
| e | 353 | 11.8% |
| das | 159 | 5.3% |
| ossos | 141 | 4.7% |
| cartilagens | 141 | 4.7% |
| articulares | 141 | 4.7% |
| de | 111 | 3.7% |
| outras | 108 | 3.6% |
| c41 | 106 | 3.5% |
| localizações | 106 | 3.5% |
| Other values (56) | 1255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2621 | ||
| s | 1750 | 9.5% |
| a | 1498 | 8.1% |
| e | 1316 | 7.1% |
| i | 1177 | 6.4% |
| o | 976 | 5.3% |
| r | 896 | 4.9% |
| l | 640 | 3.5% |
| t | 631 | 3.4% |
| c | 603 | 3.3% |
| Other values (52) | 6317 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12436 | |
| Space Separator | 2621 | 14.2% |
| Uppercase Letter | 2134 | 11.6% |
| Decimal Number | 748 | 4.1% |
| Dash Punctuation | 432 | 2.3% |
| Other Punctuation | 46 | 0.2% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1750 | |
| a | 1498 | |
| e | 1316 | |
| i | 1177 | |
| o | 976 | |
| r | 896 | 7.2% |
| l | 640 | 5.1% |
| t | 631 | 5.1% |
| c | 603 | 4.8% |
| n | 540 | 4.3% |
| Other values (20) | 2409 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 541 | |
| D | 262 | |
| O | 253 | |
| L | 174 | 8.2% |
| A | 145 | 6.8% |
| B | 105 | 4.9% |
| P | 90 | 4.2% |
| E | 80 | 3.7% |
| G | 79 | 3.7% |
| M | 76 | 3.6% |
| Other values (7) | 329 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 213 | |
| 7 | 152 | |
| 1 | 114 | |
| 2 | 108 | |
| 3 | 75 | 10.0% |
| 0 | 40 | 5.3% |
| 8 | 34 | 4.5% |
| 5 | 8 | 1.1% |
| 9 | 3 | 0.4% |
| 6 | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2621 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 432 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 46 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14570 | |
| Common | 3855 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1750 | 12.0% |
| a | 1498 | 10.3% |
| e | 1316 | 9.0% |
| i | 1177 | 8.1% |
| o | 976 | 6.7% |
| r | 896 | 6.1% |
| l | 640 | 4.4% |
| t | 631 | 4.3% |
| c | 603 | 4.1% |
| C | 541 | 3.7% |
| Other values (37) | 4542 |
Common
| Value | Count | Frequency (%) |
| 2621 | ||
| - | 432 | 11.2% |
| 4 | 213 | 5.5% |
| 7 | 152 | 3.9% |
| 1 | 114 | 3.0% |
| 2 | 108 | 2.8% |
| 3 | 75 | 1.9% |
| , | 46 | 1.2% |
| 0 | 40 | 1.0% |
| 8 | 34 | 0.9% |
| Other values (5) | 20 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17755 | |
| None | 670 | 3.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2621 | ||
| s | 1750 | 9.9% |
| a | 1498 | 8.4% |
| e | 1316 | 7.4% |
| i | 1177 | 6.6% |
| o | 976 | 5.5% |
| r | 896 | 5.0% |
| l | 640 | 3.6% |
| t | 631 | 3.6% |
| c | 603 | 3.4% |
| Other values (44) | 5647 |
None
| Value | Count | Frequency (%) |
| á | 189 | |
| ç | 129 | |
| õ | 106 | |
| ã | 91 | |
| â | 78 | |
| Ã | 53 | 7.9% |
| ô | 22 | 3.3% |
| ó | 2 | 0.3% |
metastase_ao_diagnostico_cid_o_2_2
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 4257 |
| Missing (%) | 99.6% |
| Memory size | 33.5 KiB |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
|---|---|
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C34 - Bronquios e Pulmoes | |
| C74 - Glândula Supra-renal (Glândula Adrenal) | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
Length
| Max length | 64 |
|---|---|
| Median length | 59 |
| Mean length | 52 |
| Min length | 25 |
Characters and Unicode
| Total characters | 780 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 6.7% |
Sample
| 1st row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
|---|---|
| 2nd row | C34 - Bronquios e Pulmoes |
| 3rd row | C34 - Bronquios e Pulmoes |
| 4th row | C74 - Glândula Supra-renal (Glândula Adrenal) |
| 5th row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
Common Values
| Value | Count | Frequency (%) |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 4 | 0.1% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 4 | 0.1% |
| C34 - Bronquios e Pulmoes | 2 | < 0.1% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 2 | < 0.1% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 2 | < 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 1 | < 0.1% |
| (Missing) | 4257 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 15 | 12.2% | |
| e | 13 | 10.6% |
| das | 6 | 4.9% |
| c77 | 4 | 3.3% |
| ossos | 4 | 3.3% |
| glândula | 4 | 3.3% |
| localizações | 4 | 3.3% |
| outras | 4 | 3.3% |
| de | 4 | 3.3% |
| articulares | 4 | 3.3% |
| Other values (25) | 61 |
Most occurring characters
| Value | Count | Frequency (%) |
| 108 | 13.8% | |
| a | 62 | 7.9% |
| s | 62 | 7.9% |
| e | 52 | 6.7% |
| i | 51 | 6.5% |
| o | 39 | 5.0% |
| r | 35 | 4.5% |
| l | 33 | 4.2% |
| n | 30 | 3.8% |
| c | 27 | 3.5% |
| Other values (43) | 281 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 530 | |
| Space Separator | 108 | 13.8% |
| Uppercase Letter | 89 | 11.4% |
| Decimal Number | 30 | 3.8% |
| Dash Punctuation | 19 | 2.4% |
| Open Punctuation | 2 | 0.3% |
| Close Punctuation | 2 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 62 | |
| s | 62 | |
| e | 52 | |
| i | 51 | |
| o | 39 | 7.4% |
| r | 35 | 6.6% |
| l | 33 | 6.2% |
| n | 30 | 5.7% |
| c | 27 | 5.1% |
| d | 23 | 4.3% |
| Other values (16) | 116 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 19 | |
| D | 10 | |
| O | 8 | |
| L | 8 | |
| G | 8 | |
| S | 6 | 6.7% |
| A | 6 | 6.7% |
| E | 4 | 4.5% |
| N | 4 | 4.5% |
| B | 4 | 4.5% |
| Other values (7) | 12 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 10 | |
| 4 | 9 | |
| 1 | 4 | 13.3% |
| 2 | 4 | 13.3% |
| 3 | 2 | 6.7% |
| 8 | 1 | 3.3% |
Space Separator
| Value | Count | Frequency (%) |
| 108 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 619 | |
| Common | 161 | 20.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 62 | 10.0% |
| s | 62 | 10.0% |
| e | 52 | 8.4% |
| i | 51 | 8.2% |
| o | 39 | 6.3% |
| r | 35 | 5.7% |
| l | 33 | 5.3% |
| n | 30 | 4.8% |
| c | 27 | 4.4% |
| d | 23 | 3.7% |
| Other values (33) | 205 |
Common
| Value | Count | Frequency (%) |
| 108 | ||
| - | 19 | 11.8% |
| 7 | 10 | 6.2% |
| 4 | 9 | 5.6% |
| 1 | 4 | 2.5% |
| 2 | 4 | 2.5% |
| ( | 2 | 1.2% |
| ) | 2 | 1.2% |
| 3 | 2 | 1.2% |
| 8 | 1 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 746 | |
| None | 34 | 4.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 108 | ||
| a | 62 | 8.3% |
| s | 62 | 8.3% |
| e | 52 | 7.0% |
| i | 51 | 6.8% |
| o | 39 | 5.2% |
| r | 35 | 4.7% |
| l | 33 | 4.4% |
| n | 30 | 4.0% |
| c | 27 | 3.6% |
| Other values (36) | 247 |
None
| Value | Count | Frequency (%) |
| á | 10 | |
| â | 8 | |
| ã | 4 | 11.8% |
| ç | 4 | 11.8% |
| õ | 4 | 11.8% |
| Ã | 2 | 5.9% |
| ô | 2 | 5.9% |
metastase_ao_diagnostico_cid_o_3_1
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 4097 |
| Missing (%) | 95.9% |
| Memory size | 33.5 KiB |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
|---|---|
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C34 - Bronquios e Pulmoes | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| Other values (9) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 48.914286 |
| Min length | 14 |
Characters and Unicode
| Total characters | 8560 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
|---|---|
| 2nd row | C64 - Rim, Exceto Pelve Renal |
| 3rd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 4th row | C71 - Encefalo |
| 5th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
Common Values
| Value | Count | Frequency (%) |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 42 | 1.0% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 39 | 0.9% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 22 | 0.5% |
| C34 - Bronquios e Pulmoes | 21 | 0.5% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 15 | 0.4% |
| C38 - Coração, Mediastino e Pleura, | 12 | 0.3% |
| C71 - Encefalo | 7 | 0.2% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 7 | 0.2% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 3 | 0.1% |
| C42 - Sistema hematopoiético e reticuloendotelial | 2 | < 0.1% |
| Other values (4) | 5 | 0.1% |
| (Missing) | 4097 |
Length
| Value | Count | Frequency (%) |
| 175 | 12.6% | |
| e | 162 | 11.7% |
| das | 61 | 4.4% |
| dos | 57 | 4.1% |
| articulares | 54 | 3.9% |
| cartilagens | 54 | 3.9% |
| ossos | 54 | 3.9% |
| linfáticos | 42 | 3.0% |
| c77 | 42 | 3.0% |
| especificada | 42 | 3.0% |
| Other values (50) | 642 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1210 | ||
| s | 751 | 8.8% |
| a | 644 | 7.5% |
| e | 607 | 7.1% |
| i | 584 | 6.8% |
| o | 493 | 5.8% |
| r | 382 | 4.5% |
| c | 307 | 3.6% |
| l | 286 | 3.3% |
| t | 282 | 3.3% |
| Other values (53) | 3014 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5780 | |
| Space Separator | 1210 | 14.1% |
| Uppercase Letter | 989 | 11.6% |
| Decimal Number | 350 | 4.1% |
| Dash Punctuation | 200 | 2.3% |
| Other Punctuation | 25 | 0.3% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 751 | |
| a | 644 | |
| e | 607 | |
| i | 584 | |
| o | 493 | |
| r | 382 | 6.6% |
| c | 307 | 5.3% |
| l | 286 | 4.9% |
| t | 282 | 4.9% |
| n | 276 | 4.8% |
| Other values (21) | 1168 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 243 | |
| D | 118 | |
| O | 95 | 9.6% |
| L | 81 | 8.2% |
| A | 57 | 5.8% |
| E | 51 | 5.2% |
| G | 48 | 4.9% |
| S | 47 | 4.8% |
| B | 43 | 4.3% |
| P | 42 | 4.2% |
| Other values (7) | 164 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 94 | |
| 4 | 90 | |
| 2 | 47 | |
| 1 | 47 | |
| 3 | 33 | 9.4% |
| 8 | 19 | 5.4% |
| 0 | 15 | 4.3% |
| 9 | 2 | 0.6% |
| 6 | 2 | 0.6% |
| 5 | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1210 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 200 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 25 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6769 | |
| Common | 1791 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 751 | 11.1% |
| a | 644 | 9.5% |
| e | 607 | 9.0% |
| i | 584 | 8.6% |
| o | 493 | 7.3% |
| r | 382 | 5.6% |
| c | 307 | 4.5% |
| l | 286 | 4.2% |
| t | 282 | 4.2% |
| n | 276 | 4.1% |
| Other values (38) | 2157 |
Common
| Value | Count | Frequency (%) |
| 1210 | ||
| - | 200 | 11.2% |
| 7 | 94 | 5.2% |
| 4 | 90 | 5.0% |
| 2 | 47 | 2.6% |
| 1 | 47 | 2.6% |
| 3 | 33 | 1.8% |
| , | 25 | 1.4% |
| 8 | 19 | 1.1% |
| 0 | 15 | 0.8% |
| Other values (5) | 11 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8224 | |
| None | 336 | 3.9% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1210 | ||
| s | 751 | 9.1% |
| a | 644 | 7.8% |
| e | 607 | 7.4% |
| i | 584 | 7.1% |
| o | 493 | 6.0% |
| r | 382 | 4.6% |
| c | 307 | 3.7% |
| l | 286 | 3.5% |
| t | 282 | 3.4% |
| Other values (45) | 2678 |
None
| Value | Count | Frequency (%) |
| á | 106 | |
| ã | 54 | |
| ç | 51 | |
| â | 48 | |
| õ | 39 | 11.6% |
| Ã | 22 | 6.5% |
| ô | 14 | 4.2% |
| é | 2 | 0.6% |
metastase_ao_diagnostico_cid_o_3_2
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 4266 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C34 - Bronquios e Pulmoes |
Length
| Max length | 64 |
|---|---|
| Median length | 61.5 |
| Mean length | 54.166667 |
| Min length | 25 |
Characters and Unicode
| Total characters | 325 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | C49 - Tecido Conjuntivo e de Outros Tecidos Moles |
|---|---|
| 2nd row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
| 3rd row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
| 4th row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 5th row | C34 - Bronquios e Pulmoes |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 3 | 0.1% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 1 | < 0.1% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 1 | < 0.1% |
| C34 - Bronquios e Pulmoes | 1 | < 0.1% |
| (Missing) | 4266 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 6 | 11.3% | |
| e | 6 | 11.3% |
| de | 4 | 7.5% |
| c41 | 3 | 5.7% |
| articulares | 3 | 5.7% |
| localizações | 3 | 5.7% |
| outras | 3 | 5.7% |
| cartilagens | 3 | 5.7% |
| das | 3 | 5.7% |
| ossos | 3 | 5.7% |
| Other values (16) | 16 |
Most occurring characters
| Value | Count | Frequency (%) |
| 47 | ||
| s | 33 | 10.2% |
| e | 25 | 7.7% |
| a | 24 | 7.4% |
| o | 19 | 5.8% |
| i | 19 | 5.8% |
| r | 15 | 4.6% |
| t | 12 | 3.7% |
| c | 12 | 3.7% |
| l | 12 | 3.7% |
| Other values (35) | 107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 223 | |
| Space Separator | 47 | 14.5% |
| Uppercase Letter | 37 | 11.4% |
| Decimal Number | 12 | 3.7% |
| Dash Punctuation | 6 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 33 | |
| e | 25 | |
| a | 24 | |
| o | 19 | |
| i | 19 | |
| r | 15 | 6.7% |
| t | 12 | 5.4% |
| c | 12 | 5.4% |
| l | 12 | 5.4% |
| u | 11 | 4.9% |
| Other values (15) | 41 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 10 | |
| O | 7 | |
| D | 4 | 10.8% |
| L | 4 | 10.8% |
| A | 3 | 8.1% |
| T | 2 | 5.4% |
| P | 1 | 2.7% |
| B | 1 | 2.7% |
| G | 1 | 2.7% |
| E | 1 | 2.7% |
| Other values (3) | 3 | 8.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 5 | |
| 1 | 3 | |
| 7 | 2 | 16.7% |
| 3 | 1 | 8.3% |
| 9 | 1 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 47 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 260 | |
| Common | 65 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 33 | |
| e | 25 | 9.6% |
| a | 24 | 9.2% |
| o | 19 | 7.3% |
| i | 19 | 7.3% |
| r | 15 | 5.8% |
| t | 12 | 4.6% |
| c | 12 | 4.6% |
| l | 12 | 4.6% |
| u | 11 | 4.2% |
| Other values (28) | 78 |
Common
| Value | Count | Frequency (%) |
| 47 | ||
| - | 6 | 9.2% |
| 4 | 5 | 7.7% |
| 1 | 3 | 4.6% |
| 7 | 2 | 3.1% |
| 3 | 1 | 1.5% |
| 9 | 1 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 315 | |
| None | 10 | 3.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 47 | ||
| s | 33 | 10.5% |
| e | 25 | 7.9% |
| a | 24 | 7.6% |
| o | 19 | 6.0% |
| i | 19 | 6.0% |
| r | 15 | 4.8% |
| t | 12 | 3.8% |
| c | 12 | 3.8% |
| l | 12 | 3.8% |
| Other values (30) | 97 |
None
| Value | Count | Frequency (%) |
| ç | 3 | |
| õ | 3 | |
| á | 2 | |
| â | 1 | 10.0% |
| ã | 1 | 10.0% |
metastase_ao_diagnostico_cid_o_4_1
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 4205 |
| Missing (%) | 98.4% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C34 - Bronquios e Pulmoes | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| Other values (8) |
Length
| Max length | 64 |
|---|---|
| Median length | 59 |
| Mean length | 50.253731 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3367 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | C38 - Coração, Mediastino e Pleura, |
|---|---|
| 2nd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 3rd row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 4th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 5th row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 21 | 0.5% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 11 | 0.3% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 9 | 0.2% |
| C34 - Bronquios e Pulmoes | 6 | 0.1% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 5 | 0.1% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 4 | 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 3 | 0.1% |
| C38 - Coração, Mediastino e Pleura, | 2 | < 0.1% |
| C71 - Encefalo | 2 | < 0.1% |
| C75 - Outras Glândulas Endócrinas e de Estruturas Relacionadas | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
| (Missing) | 4205 |
Length
| Value | Count | Frequency (%) |
| 67 | 12.4% | |
| e | 59 | 10.9% |
| das | 32 | 5.9% |
| ossos | 26 | 4.8% |
| cartilagens | 26 | 4.8% |
| articulares | 26 | 4.8% |
| de | 23 | 4.2% |
| outras | 22 | 4.1% |
| c41 | 21 | 3.9% |
| localizações | 21 | 3.9% |
| Other values (46) | 219 |
Most occurring characters
| Value | Count | Frequency (%) |
| 475 | ||
| s | 312 | 9.3% |
| a | 292 | 8.7% |
| e | 240 | 7.1% |
| i | 203 | 6.0% |
| r | 172 | 5.1% |
| o | 159 | 4.7% |
| l | 134 | 4.0% |
| t | 120 | 3.6% |
| c | 103 | 3.1% |
| Other values (53) | 1157 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2277 | |
| Space Separator | 475 | 14.1% |
| Uppercase Letter | 387 | 11.5% |
| Decimal Number | 134 | 4.0% |
| Dash Punctuation | 82 | 2.4% |
| Close Punctuation | 4 | 0.1% |
| Open Punctuation | 4 | 0.1% |
| Other Punctuation | 4 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 312 | |
| a | 292 | |
| e | 240 | |
| i | 203 | |
| r | 172 | 7.6% |
| o | 159 | 7.0% |
| l | 134 | 5.9% |
| t | 120 | 5.3% |
| c | 103 | 4.5% |
| n | 102 | 4.5% |
| Other values (21) | 440 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 96 | |
| O | 50 | |
| D | 46 | |
| A | 30 | 7.8% |
| L | 30 | 7.8% |
| B | 18 | 4.7% |
| G | 18 | 4.7% |
| E | 13 | 3.4% |
| S | 13 | 3.4% |
| P | 11 | 2.8% |
| Other values (7) | 62 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 40 | |
| 7 | 26 | |
| 1 | 23 | |
| 2 | 22 | |
| 3 | 8 | 6.0% |
| 8 | 5 | 3.7% |
| 0 | 5 | 3.7% |
| 5 | 2 | 1.5% |
| 6 | 2 | 1.5% |
| 9 | 1 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 475 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 82 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2664 | |
| Common | 703 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 312 | 11.7% |
| a | 292 | 11.0% |
| e | 240 | 9.0% |
| i | 203 | 7.6% |
| r | 172 | 6.5% |
| o | 159 | 6.0% |
| l | 134 | 5.0% |
| t | 120 | 4.5% |
| c | 103 | 3.9% |
| n | 102 | 3.8% |
| Other values (38) | 827 |
Common
| Value | Count | Frequency (%) |
| 475 | ||
| - | 82 | 11.7% |
| 4 | 40 | 5.7% |
| 7 | 26 | 3.7% |
| 1 | 23 | 3.3% |
| 2 | 22 | 3.1% |
| 3 | 8 | 1.1% |
| 8 | 5 | 0.7% |
| 0 | 5 | 0.7% |
| ) | 4 | 0.6% |
| Other values (5) | 13 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3247 | |
| None | 120 | 3.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 475 | ||
| s | 312 | 9.6% |
| a | 292 | 9.0% |
| e | 240 | 7.4% |
| i | 203 | 6.3% |
| r | 172 | 5.3% |
| o | 159 | 4.9% |
| l | 134 | 4.1% |
| t | 120 | 3.7% |
| c | 103 | 3.2% |
| Other values (45) | 1037 |
None
| Value | Count | Frequency (%) |
| á | 29 | |
| ç | 23 | |
| õ | 21 | |
| â | 18 | |
| ã | 11 | 9.2% |
| Ã | 11 | 9.2% |
| ô | 6 | 5.0% |
| ó | 1 | 0.8% |
metastase_ao_diagnostico_cid_o_4_2
Categorical
MISSING  UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4270 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
|---|---|
| C48 - Tecidos Moles do Retroperitônio e do Peritônio |
Length
| Max length | 52 |
|---|---|
| Median length | 50 |
| Mean length | 50 |
| Min length | 48 |
Characters and Unicode
| Total characters | 100 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
|---|---|
| 2nd row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
Common Values
| Value | Count | Frequency (%) |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 1 | < 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 1 | < 0.1% |
| (Missing) | 4270 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | ||
| e | 2 | |
| do | 2 | |
| c22 | 1 | 5.9% |
| fÃgado | 1 | 5.9% |
| das | 1 | 5.9% |
| vias | 1 | 5.9% |
| biliares | 1 | 5.9% |
| intra-hepáticas | 1 | 5.9% |
| c48 | 1 | 5.9% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| 15 | ||
| e | 9 | 9.0% |
| i | 9 | 9.0% |
| o | 8 | 8.0% |
| a | 6 | 6.0% |
| s | 6 | 6.0% |
| t | 5 | 5.0% |
| r | 5 | 5.0% |
| d | 4 | 4.0% |
| - | 3 | 3.0% |
| Other values (22) | 30 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67 | |
| Space Separator | 15 | 15.0% |
| Uppercase Letter | 11 | 11.0% |
| Decimal Number | 4 | 4.0% |
| Dash Punctuation | 3 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9 | |
| i | 9 | |
| o | 8 | |
| a | 6 | |
| s | 6 | |
| t | 5 | |
| r | 5 | |
| d | 4 | 6.0% |
| n | 3 | 4.5% |
| ô | 2 | 3.0% |
| Other values (7) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| B | 1 | |
| I | 1 | |
| V | 1 | |
| D | 1 | |
| T | 1 | |
| M | 1 | |
| R | 1 | |
| F | 1 | |
| P | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 4 | 1 | |
| 8 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 15 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78 | |
| Common | 22 | 22.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9 | |
| i | 9 | |
| o | 8 | 10.3% |
| a | 6 | 7.7% |
| s | 6 | 7.7% |
| t | 5 | 6.4% |
| r | 5 | 6.4% |
| d | 4 | 5.1% |
| n | 3 | 3.8% |
| C | 2 | 2.6% |
| Other values (17) | 21 |
Common
| Value | Count | Frequency (%) |
| 15 | ||
| - | 3 | 13.6% |
| 2 | 2 | 9.1% |
| 4 | 1 | 4.5% |
| 8 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96 | |
| None | 4 | 4.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 15 | ||
| e | 9 | 9.4% |
| i | 9 | 9.4% |
| o | 8 | 8.3% |
| a | 6 | 6.2% |
| s | 6 | 6.2% |
| t | 5 | 5.2% |
| r | 5 | 5.2% |
| d | 4 | 4.2% |
| - | 3 | 3.1% |
| Other values (19) | 26 |
None
| Value | Count | Frequency (%) |
| ô | 2 | |
| á | 1 | |
| Ã | 1 |
data_do_tratamento_1
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 2405 |
|---|---|
| Distinct (%) | 56.7% |
| Missing | 28 |
| Missing (%) | 0.7% |
| Memory size | 33.5 KiB |
| 2015-01-08 | 7 |
|---|---|
| 2015-05-13 | 7 |
| 2017-11-01 | 6 |
| 2016-01-04 | 6 |
| 2016-07-05 | 6 |
| Other values (2400) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 42440 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1279 ? |
|---|---|
| Unique (%) | 30.1% |
Sample
| 1st row | 2008-08-15 |
|---|---|
| 2nd row | 2008-05-29 |
| 3rd row | 2008-04-07 |
| 4th row | 2008-09-29 |
| 5th row | 2008-09-16 |
Common Values
| Value | Count | Frequency (%) |
| 2015-01-08 | 7 | 0.2% |
| 2015-05-13 | 7 | 0.2% |
| 2017-11-01 | 6 | 0.1% |
| 2016-01-04 | 6 | 0.1% |
| 2016-07-05 | 6 | 0.1% |
| 2017-07-29 | 6 | 0.1% |
| 2015-11-12 | 6 | 0.1% |
| 2017-03-06 | 6 | 0.1% |
| 2015-09-13 | 6 | 0.1% |
| 2017-12-17 | 6 | 0.1% |
| Other values (2395) | 4182 | |
| (Missing) | 28 | 0.7% |
Length
| Value | Count | Frequency (%) |
| 2015-01-08 | 7 | 0.2% |
| 2015-05-13 | 7 | 0.2% |
| 2015-09-13 | 6 | 0.1% |
| 2016-10-31 | 6 | 0.1% |
| 2017-07-20 | 6 | 0.1% |
| 2017-04-11 | 6 | 0.1% |
| 2017-12-17 | 6 | 0.1% |
| 2016-05-31 | 6 | 0.1% |
| 2017-03-06 | 6 | 0.1% |
| 2015-11-12 | 6 | 0.1% |
| Other values (2395) | 4182 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9859 | |
| - | 8488 | |
| 1 | 8134 | |
| 2 | 7279 | |
| 6 | 1458 | 3.4% |
| 7 | 1407 | 3.3% |
| 3 | 1396 | 3.3% |
| 5 | 1373 | 3.2% |
| 8 | 1080 | 2.5% |
| 4 | 1079 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33952 | |
| Dash Punctuation | 8488 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9859 | |
| 1 | 8134 | |
| 2 | 7279 | |
| 6 | 1458 | 4.3% |
| 7 | 1407 | 4.1% |
| 3 | 1396 | 4.1% |
| 5 | 1373 | 4.0% |
| 8 | 1080 | 3.2% |
| 4 | 1079 | 3.2% |
| 9 | 887 | 2.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8488 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42440 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9859 | |
| - | 8488 | |
| 1 | 8134 | |
| 2 | 7279 | |
| 6 | 1458 | 3.4% |
| 7 | 1407 | 3.3% |
| 3 | 1396 | 3.3% |
| 5 | 1373 | 3.2% |
| 8 | 1080 | 2.5% |
| 4 | 1079 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9859 | |
| - | 8488 | |
| 1 | 8134 | |
| 2 | 7279 | |
| 6 | 1458 | 3.4% |
| 7 | 1407 | 3.3% |
| 3 | 1396 | 3.3% |
| 5 | 1373 | 3.2% |
| 8 | 1080 | 2.5% |
| 4 | 1079 | 2.5% |
data_do_tratamento_2
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 322 |
|---|---|
| Distinct (%) | 93.6% |
| Missing | 3928 |
| Missing (%) | 91.9% |
| Memory size | 33.5 KiB |
| 2017-10-25 | 3 |
|---|---|
| 2016-05-14 | 3 |
| 2017-10-23 | 2 |
| 2013-12-26 | 2 |
| 2012-12-05 | 2 |
| Other values (317) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3440 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 302 ? |
|---|---|
| Unique (%) | 87.8% |
Sample
| 1st row | 2014-06-17 |
|---|---|
| 2nd row | 2010-03-22 |
| 3rd row | 2016-08-24 |
| 4th row | 2007-12-06 |
| 5th row | 2011-05-10 |
Common Values
| Value | Count | Frequency (%) |
| 2017-10-25 | 3 | 0.1% |
| 2016-05-14 | 3 | 0.1% |
| 2017-10-23 | 2 | < 0.1% |
| 2013-12-26 | 2 | < 0.1% |
| 2012-12-05 | 2 | < 0.1% |
| 2018-06-03 | 2 | < 0.1% |
| 2013-02-23 | 2 | < 0.1% |
| 2017-09-01 | 2 | < 0.1% |
| 2017-06-10 | 2 | < 0.1% |
| 2013-08-15 | 2 | < 0.1% |
| Other values (312) | 322 | 7.5% |
| (Missing) | 3928 |
Length
| Value | Count | Frequency (%) |
| 2017-10-25 | 3 | 0.9% |
| 2016-05-14 | 3 | 0.9% |
| 2017-01-12 | 2 | 0.6% |
| 2014-08-07 | 2 | 0.6% |
| 2017-12-09 | 2 | 0.6% |
| 2014-09-25 | 2 | 0.6% |
| 2015-06-13 | 2 | 0.6% |
| 2016-03-21 | 2 | 0.6% |
| 2015-06-16 | 2 | 0.6% |
| 2016-08-31 | 2 | 0.6% |
| Other values (312) | 322 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 799 | |
| - | 688 | |
| 1 | 635 | |
| 2 | 585 | |
| 7 | 133 | 3.9% |
| 6 | 113 | 3.3% |
| 5 | 106 | 3.1% |
| 3 | 103 | 3.0% |
| 8 | 103 | 3.0% |
| 4 | 88 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2752 | |
| Dash Punctuation | 688 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 799 | |
| 1 | 635 | |
| 2 | 585 | |
| 7 | 133 | 4.8% |
| 6 | 113 | 4.1% |
| 5 | 106 | 3.9% |
| 3 | 103 | 3.7% |
| 8 | 103 | 3.7% |
| 4 | 88 | 3.2% |
| 9 | 87 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 688 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3440 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 799 | |
| - | 688 | |
| 1 | 635 | |
| 2 | 585 | |
| 7 | 133 | 3.9% |
| 6 | 113 | 3.3% |
| 5 | 106 | 3.1% |
| 3 | 103 | 3.0% |
| 8 | 103 | 3.0% |
| 4 | 88 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 799 | |
| - | 688 | |
| 1 | 635 | |
| 2 | 585 | |
| 7 | 133 | 3.9% |
| 6 | 113 | 3.3% |
| 5 | 106 | 3.1% |
| 3 | 103 | 3.0% |
| 8 | 103 | 3.0% |
| 4 | 88 | 2.6% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Outras combinações | |
|---|---|
| Cirurgia + Radio + Quimio + Hormonio | |
| Cirurgia + Radio + Quimio | |
| Quimioterapia | |
| Radioterapia + Quimioterapia | |
| Other values (5) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 23.212781 |
| Min length | 8 |
Characters and Unicode
| Total characters | 99165 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cirurgia + Radio + Quimio + Hormonio |
|---|---|
| 2nd row | Cirurgia + Quimioterapia |
| 3rd row | Outras combinações |
| 4th row | Outras combinações |
| 5th row | Cirurgia + Radio + Quimio |
Common Values
| Value | Count | Frequency (%) |
| Outras combinações | 1807 | |
| Cirurgia + Radio + Quimio + Hormonio | 892 | |
| Cirurgia + Radio + Quimio | 578 | 13.5% |
| Quimioterapia | 335 | 7.8% |
| Radioterapia + Quimioterapia | 333 | 7.8% |
| Cirurgia + Quimioterapia | 165 | 3.9% |
| Cirurgia | 51 | 1.2% |
| Nenhum tratamento | 45 | 1.1% |
| Cirurgia + Radioterapia | 43 | 1.0% |
| Radioterapia | 23 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4373 | ||
| outras | 1807 | |
| combinações | 1807 | |
| cirurgia | 1729 | 11.6% |
| radio | 1470 | 9.9% |
| quimio | 1470 | 9.9% |
| hormonio | 892 | 6.0% |
| quimioterapia | 833 | 5.6% |
| radioterapia | 399 | 2.7% |
| nenhum | 45 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 13864 | |
| 10598 | 10.7% | |
| a | 9766 | 9.8% |
| o | 8700 | 8.8% |
| r | 7434 | 7.5% |
| u | 5884 | 5.9% |
| m | 5092 | 5.1% |
| + | 4373 | 4.4% |
| s | 3614 | 3.6% |
| t | 3174 | 3.2% |
| Other values (16) | 26666 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75549 | |
| Space Separator | 10598 | 10.7% |
| Uppercase Letter | 8645 | 8.7% |
| Math Symbol | 4373 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 13864 | |
| a | 9766 | |
| o | 8700 | |
| r | 7434 | |
| u | 5884 | |
| m | 5092 | 6.7% |
| s | 3614 | 4.8% |
| t | 3174 | 4.2% |
| e | 3129 | 4.1% |
| n | 2789 | 3.7% |
| Other values (8) | 12103 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 2303 | |
| R | 1869 | |
| O | 1807 | |
| C | 1729 | |
| H | 892 | 10.3% |
| N | 45 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 10598 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4373 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 84194 | |
| Common | 14971 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 13864 | |
| a | 9766 | |
| o | 8700 | |
| r | 7434 | 8.8% |
| u | 5884 | 7.0% |
| m | 5092 | 6.0% |
| s | 3614 | 4.3% |
| t | 3174 | 3.8% |
| e | 3129 | 3.7% |
| n | 2789 | 3.3% |
| Other values (14) | 20748 |
Common
| Value | Count | Frequency (%) |
| 10598 | ||
| + | 4373 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95551 | |
| None | 3614 | 3.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 13864 | |
| 10598 | ||
| a | 9766 | |
| o | 8700 | 9.1% |
| r | 7434 | 7.8% |
| u | 5884 | 6.2% |
| m | 5092 | 5.3% |
| + | 4373 | 4.6% |
| s | 3614 | 3.8% |
| t | 3174 | 3.3% |
| Other values (14) | 23052 |
None
| Value | Count | Frequency (%) |
| ç | 1807 | |
| õ | 1807 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| Outras combinações | |
|---|---|
| Cirurgia | |
| Quimioterapia | |
| Cirurgia + Radio + Quimio + Hormonio | |
| Nenhum tratamento | |
| Other values (5) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 16.815718 |
| Min length | 8 |
Characters and Unicode
| Total characters | 6205 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Quimioterapia |
|---|---|
| 2nd row | Cirurgia + Quimioterapia |
| 3rd row | Cirurgia + Radio + Quimio + Hormonio |
| 4th row | Nenhum tratamento |
| 5th row | Cirurgia |
Common Values
| Value | Count | Frequency (%) |
| Outras combinações | 113 | 2.6% |
| Cirurgia | 103 | 2.4% |
| Quimioterapia | 40 | 0.9% |
| Cirurgia + Radio + Quimio + Hormonio | 27 | 0.6% |
| Nenhum tratamento | 27 | 0.6% |
| Cirurgia + Quimioterapia | 19 | 0.4% |
| Cirurgia + Radio + Quimio | 13 | 0.3% |
| Radioterapia + Quimioterapia | 12 | 0.3% |
| Cirurgia + Radioterapia | 9 | 0.2% |
| Radioterapia | 6 | 0.1% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cirurgia | 171 | |
| 147 | ||
| outras | 113 | |
| combinações | 113 | |
| quimioterapia | 71 | |
| radio | 40 | 5.0% |
| quimio | 40 | 5.0% |
| hormonio | 27 | 3.4% |
| nenhum | 27 | 3.4% |
| tratamento | 27 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 869 | |
| a | 714 | |
| r | 607 | 9.8% |
| 434 | 7.0% | |
| u | 422 | 6.8% |
| o | 399 | 6.4% |
| m | 305 | 4.9% |
| t | 292 | 4.7% |
| e | 265 | 4.3% |
| s | 226 | 3.6% |
| Other values (16) | 1672 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5108 | |
| Uppercase Letter | 516 | 8.3% |
| Space Separator | 434 | 7.0% |
| Math Symbol | 147 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 869 | |
| a | 714 | |
| r | 607 | |
| u | 422 | |
| o | 399 | |
| m | 305 | 6.0% |
| t | 292 | 5.7% |
| e | 265 | 5.2% |
| s | 226 | 4.4% |
| n | 194 | 3.8% |
| Other values (8) | 815 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 171 | |
| O | 113 | |
| Q | 111 | |
| R | 67 | 13.0% |
| H | 27 | 5.2% |
| N | 27 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 434 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 147 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5624 | |
| Common | 581 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 869 | |
| a | 714 | |
| r | 607 | |
| u | 422 | 7.5% |
| o | 399 | 7.1% |
| m | 305 | 5.4% |
| t | 292 | 5.2% |
| e | 265 | 4.7% |
| s | 226 | 4.0% |
| n | 194 | 3.4% |
| Other values (14) | 1331 |
Common
| Value | Count | Frequency (%) |
| 434 | ||
| + | 147 | 25.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5979 | |
| None | 226 | 3.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 869 | |
| a | 714 | |
| r | 607 | |
| 434 | 7.3% | |
| u | 422 | 7.1% |
| o | 399 | 6.7% |
| m | 305 | 5.1% |
| t | 292 | 4.9% |
| e | 265 | 4.4% |
| s | 226 | 3.8% |
| Other values (14) | 1446 |
None
| Value | Count | Frequency (%) |
| ç | 113 | |
| õ | 113 |
ano_do_diagnostico_1
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2014.3897 |
| Minimum | 2008 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 2008 |
|---|---|
| 5-th percentile | 2010 |
| Q1 | 2012 |
| median | 2015 |
| Q3 | 2017 |
| 95-th percentile | 2018 |
| Maximum | 2020 |
| Range | 12 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.6955947 |
|---|---|
| Coefficient of variation (CV) | 0.0013381694 |
| Kurtosis | -0.7535391 |
| Mean | 2014.3897 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.1738211 |
| Sum | 8605473 |
| Variance | 7.2662306 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 646 | |
| 2016 | 645 | |
| 2015 | 602 | |
| 2011 | 481 | |
| 2013 | 426 | |
| 2012 | 410 | |
| 2014 | 314 | |
| 2018 | 264 | |
| 2010 | 193 | 4.5% |
| 2020 | 115 | 2.7% |
| Other values (3) | 176 | 4.1% |
| Value | Count | Frequency (%) |
| 2008 | 40 | 0.9% |
| 2009 | 89 | 2.1% |
| 2010 | 193 | 4.5% |
| 2011 | 481 | |
| 2012 | 410 | |
| 2013 | 426 | |
| 2014 | 314 | |
| 2015 | 602 | |
| 2016 | 645 | |
| 2017 | 646 |
| Value | Count | Frequency (%) |
| 2020 | 115 | 2.7% |
| 2019 | 47 | 1.1% |
| 2018 | 264 | |
| 2017 | 646 | |
| 2016 | 645 | |
| 2015 | 602 | |
| 2014 | 314 | |
| 2013 | 426 | |
| 2012 | 410 | |
| 2011 | 481 |
ano_do_diagnostico_2
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.8293 |
| Minimum | 2008 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 2008 |
|---|---|
| 5-th percentile | 2011 |
| Q1 | 2014 |
| median | 2016 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2020 |
| Range | 12 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.5687524 |
|---|---|
| Coefficient of variation (CV) | 0.0012742907 |
| Kurtosis | -0.021503999 |
| Mean | 2015.8293 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.61985561 |
| Sum | 743841 |
| Variance | 6.5984889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 70 | 1.6% |
| 2018 | 56 | 1.3% |
| 2016 | 54 | 1.3% |
| 2015 | 36 | 0.8% |
| 2014 | 35 | 0.8% |
| 2013 | 31 | 0.7% |
| 2019 | 27 | 0.6% |
| 2020 | 20 | 0.5% |
| 2012 | 14 | 0.3% |
| 2011 | 14 | 0.3% |
| Other values (3) | 12 | 0.3% |
| (Missing) | 3903 |
| Value | Count | Frequency (%) |
| 2008 | 2 | < 0.1% |
| 2009 | 5 | 0.1% |
| 2010 | 5 | 0.1% |
| 2011 | 14 | 0.3% |
| 2012 | 14 | 0.3% |
| 2013 | 31 | |
| 2014 | 35 | |
| 2015 | 36 | |
| 2016 | 54 | |
| 2017 | 70 |
| Value | Count | Frequency (%) |
| 2020 | 20 | 0.5% |
| 2019 | 27 | 0.6% |
| 2018 | 56 | |
| 2017 | 70 | |
| 2016 | 54 | |
| 2015 | 36 | |
| 2014 | 35 | |
| 2013 | 31 | |
| 2012 | 14 | 0.3% |
| 2011 | 14 | 0.3% |
lateralidade_do_tumor_1
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Esquerda | |
|---|---|
| Direita | |
| não se aplica | 126 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.6872659 |
| Min length | 7 |
Characters and Unicode
| Total characters | 32840 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Esquerda |
|---|---|
| 2nd row | Esquerda |
| 3rd row | Esquerda |
| 4th row | Esquerda |
| 5th row | Direita |
Common Values
| Value | Count | Frequency (%) |
| Esquerda | 2180 | |
| Direita | 1966 | |
| não se aplica | 126 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| esquerda | 2180 | |
| direita | 1966 | |
| não | 126 | 2.8% |
| se | 126 | 2.8% |
| aplica | 126 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4398 | |
| e | 4272 | |
| r | 4146 | |
| i | 4058 | |
| s | 2306 | |
| d | 2180 | |
| E | 2180 | |
| u | 2180 | |
| q | 2180 | |
| D | 1966 | |
| Other values (8) | 2974 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28442 | |
| Uppercase Letter | 4146 | 12.6% |
| Space Separator | 252 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4398 | |
| e | 4272 | |
| r | 4146 | |
| i | 4058 | |
| s | 2306 | |
| d | 2180 | |
| u | 2180 | |
| q | 2180 | |
| t | 1966 | |
| n | 126 | 0.4% |
| Other values (5) | 630 | 2.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2180 | |
| D | 1966 |
Space Separator
| Value | Count | Frequency (%) |
| 252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32588 | |
| Common | 252 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4398 | |
| e | 4272 | |
| r | 4146 | |
| i | 4058 | |
| s | 2306 | |
| d | 2180 | |
| E | 2180 | |
| u | 2180 | |
| q | 2180 | |
| D | 1966 | |
| Other values (7) | 2722 |
Common
| Value | Count | Frequency (%) |
| 252 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32714 | |
| None | 126 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4398 | |
| e | 4272 | |
| r | 4146 | |
| i | 4058 | |
| s | 2306 | |
| d | 2180 | |
| E | 2180 | |
| u | 2180 | |
| q | 2180 | |
| D | 1966 | |
| Other values (7) | 2848 |
None
| Value | Count | Frequency (%) |
| ã | 126 |
lateralidade_do_tumor_2
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| não se aplica | |
|---|---|
| Esquerda | |
| Direita | |
| Bilateral | 2 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.6585366 |
| Min length | 7 |
Characters and Unicode
| Total characters | 3564 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | não se aplica |
|---|---|
| 2nd row | não se aplica |
| 3rd row | Direita |
| 4th row | não se aplica |
| 5th row | não se aplica |
Common Values
| Value | Count | Frequency (%) |
| não se aplica | 142 | 3.3% |
| Esquerda | 125 | 2.9% |
| Direita | 100 | 2.3% |
| Bilateral | 2 | < 0.1% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 142 | |
| se | 142 | |
| aplica | 142 | |
| esquerda | 125 | |
| direita | 100 | |
| bilateral | 2 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 513 | |
| e | 369 | 10.4% |
| i | 344 | 9.7% |
| 284 | 8.0% | |
| s | 267 | 7.5% |
| r | 227 | 6.4% |
| l | 146 | 4.1% |
| ã | 142 | 4.0% |
| c | 142 | 4.0% |
| n | 142 | 4.0% |
| Other values (9) | 988 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3053 | |
| Space Separator | 284 | 8.0% |
| Uppercase Letter | 227 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 513 | |
| e | 369 | |
| i | 344 | |
| s | 267 | |
| r | 227 | 7.4% |
| l | 146 | 4.8% |
| ã | 142 | 4.7% |
| c | 142 | 4.7% |
| n | 142 | 4.7% |
| p | 142 | 4.7% |
| Other values (5) | 619 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 125 | |
| D | 100 | |
| B | 2 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 284 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3280 | |
| Common | 284 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 513 | |
| e | 369 | |
| i | 344 | |
| s | 267 | 8.1% |
| r | 227 | 6.9% |
| l | 146 | 4.5% |
| ã | 142 | 4.3% |
| c | 142 | 4.3% |
| n | 142 | 4.3% |
| p | 142 | 4.3% |
| Other values (8) | 846 |
Common
| Value | Count | Frequency (%) |
| 284 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3422 | |
| None | 142 | 4.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 513 | |
| e | 369 | |
| i | 344 | |
| 284 | 8.3% | |
| s | 267 | 7.8% |
| r | 227 | 6.6% |
| l | 146 | 4.3% |
| c | 142 | 4.1% |
| n | 142 | 4.1% |
| p | 142 | 4.1% |
| Other values (8) | 846 |
None
| Value | Count | Frequency (%) |
| ã | 142 |
data_de_recidiva_1
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 1021 |
|---|---|
| Distinct (%) | 81.7% |
| Missing | 3023 |
| Missing (%) | 70.8% |
| Memory size | 33.5 KiB |
| 2017-07-09 | 4 |
|---|---|
| 2016-08-08 | 4 |
| 2018-09-03 | 4 |
| 2014-11-06 | 4 |
| 2018-11-16 | 4 |
| Other values (1016) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 12490 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 835 ? |
|---|---|
| Unique (%) | 66.9% |
Sample
| 1st row | 2014-07-19 |
|---|---|
| 2nd row | 2010-07-15 |
| 3rd row | 2012-12-19 |
| 4th row | 2016-02-29 |
| 5th row | 2009-08-14 |
Common Values
| Value | Count | Frequency (%) |
| 2017-07-09 | 4 | 0.1% |
| 2016-08-08 | 4 | 0.1% |
| 2018-09-03 | 4 | 0.1% |
| 2014-11-06 | 4 | 0.1% |
| 2018-11-16 | 4 | 0.1% |
| 2017-11-18 | 4 | 0.1% |
| 2014-08-21 | 3 | 0.1% |
| 2017-10-17 | 3 | 0.1% |
| 2017-11-21 | 3 | 0.1% |
| 2018-08-24 | 3 | 0.1% |
| Other values (1011) | 1213 | |
| (Missing) | 3023 |
Length
| Value | Count | Frequency (%) |
| 2017-07-09 | 4 | 0.3% |
| 2018-11-16 | 4 | 0.3% |
| 2017-11-18 | 4 | 0.3% |
| 2016-08-08 | 4 | 0.3% |
| 2014-11-06 | 4 | 0.3% |
| 2018-09-03 | 4 | 0.3% |
| 2017-10-26 | 3 | 0.2% |
| 2017-08-20 | 3 | 0.2% |
| 2014-10-04 | 3 | 0.2% |
| 2019-01-30 | 3 | 0.2% |
| Other values (1011) | 1213 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| - | 2498 | |
| 1 | 2336 | |
| 2 | 2077 | |
| 7 | 473 | 3.8% |
| 8 | 461 | 3.7% |
| 3 | 402 | 3.2% |
| 6 | 389 | 3.1% |
| 5 | 374 | 3.0% |
| 9 | 336 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9992 | |
| Dash Punctuation | 2498 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| 1 | 2336 | |
| 2 | 2077 | |
| 7 | 473 | 4.7% |
| 8 | 461 | 4.6% |
| 3 | 402 | 4.0% |
| 6 | 389 | 3.9% |
| 5 | 374 | 3.7% |
| 9 | 336 | 3.4% |
| 4 | 331 | 3.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2498 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12490 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| - | 2498 | |
| 1 | 2336 | |
| 2 | 2077 | |
| 7 | 473 | 3.8% |
| 8 | 461 | 3.7% |
| 3 | 402 | 3.2% |
| 6 | 389 | 3.1% |
| 5 | 374 | 3.0% |
| 9 | 336 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| - | 2498 | |
| 1 | 2336 | |
| 2 | 2077 | |
| 7 | 473 | 3.8% |
| 8 | 461 | 3.7% |
| 3 | 402 | 3.2% |
| 6 | 389 | 3.1% |
| 5 | 374 | 3.0% |
| 9 | 336 | 2.7% |
data_de_recidiva_2
Categorical
MISSING  UNIFORM 
| Distinct | 45 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 4226 |
| Missing (%) | 98.9% |
| Memory size | 33.5 KiB |
| 2019-02-16 | 2 |
|---|---|
| 2017-10-26 | 1 |
| 2019-02-03 | 1 |
| 2019-02-09 | 1 |
| 2018-04-16 | 1 |
| Other values (40) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 460 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | 95.7% |
Sample
| 1st row | 2011-08-04 |
|---|---|
| 2nd row | 2012-04-24 |
| 3rd row | 2017-10-11 |
| 4th row | 2017-03-24 |
| 5th row | 2018-11-24 |
Common Values
| Value | Count | Frequency (%) |
| 2019-02-16 | 2 | < 0.1% |
| 2017-10-26 | 1 | < 0.1% |
| 2019-02-03 | 1 | < 0.1% |
| 2019-02-09 | 1 | < 0.1% |
| 2018-04-16 | 1 | < 0.1% |
| 2019-05-06 | 1 | < 0.1% |
| 2017-07-27 | 1 | < 0.1% |
| 2017-03-14 | 1 | < 0.1% |
| 2020-11-05 | 1 | < 0.1% |
| 2015-10-24 | 1 | < 0.1% |
| Other values (35) | 35 | 0.8% |
| (Missing) | 4226 |
Length
| Value | Count | Frequency (%) |
| 2019-02-16 | 2 | 4.3% |
| 2015-10-23 | 1 | 2.2% |
| 2012-04-24 | 1 | 2.2% |
| 2017-10-11 | 1 | 2.2% |
| 2017-03-24 | 1 | 2.2% |
| 2018-11-24 | 1 | 2.2% |
| 2012-09-07 | 1 | 2.2% |
| 2012-03-22 | 1 | 2.2% |
| 2015-04-27 | 1 | 2.2% |
| 2011-04-04 | 1 | 2.2% |
| Other values (35) | 35 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 104 | |
| - | 92 | |
| 1 | 82 | |
| 2 | 81 | |
| 7 | 20 | 4.3% |
| 9 | 18 | 3.9% |
| 5 | 16 | 3.5% |
| 4 | 15 | 3.3% |
| 6 | 14 | 3.0% |
| 3 | 10 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 368 | |
| Dash Punctuation | 92 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 104 | |
| 1 | 82 | |
| 2 | 81 | |
| 7 | 20 | 5.4% |
| 9 | 18 | 4.9% |
| 5 | 16 | 4.3% |
| 4 | 15 | 4.1% |
| 6 | 14 | 3.8% |
| 3 | 10 | 2.7% |
| 8 | 8 | 2.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 92 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 460 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 104 | |
| - | 92 | |
| 1 | 82 | |
| 2 | 81 | |
| 7 | 20 | 4.3% |
| 9 | 18 | 3.9% |
| 5 | 16 | 3.5% |
| 4 | 15 | 3.3% |
| 6 | 14 | 3.0% |
| 3 | 10 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 104 | |
| - | 92 | |
| 1 | 82 | |
| 2 | 81 | |
| 7 | 20 | 4.3% |
| 9 | 18 | 3.9% |
| 5 | 16 | 3.5% |
| 4 | 15 | 3.3% |
| 6 | 14 | 3.0% |
| 3 | 10 | 2.2% |
tempo_desde_o_diagnostico_ate_a_primeira_recidiv_1
Real number (ℝ)
| Distinct | 821 |
|---|---|
| Distinct (%) | 65.7% |
| Missing | 3023 |
| Missing (%) | 70.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 633.98559 |
| Minimum | 0 |
|---|---|
| Maximum | 3462 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 68 |
| Q1 | 256 |
| median | 489 |
| Q3 | 868 |
| 95-th percentile | 1708.2 |
| Maximum | 3462 |
| Range | 3462 |
| Interquartile range (IQR) | 612 |
Descriptive statistics
| Standard deviation | 535.467 |
|---|---|
| Coefficient of variation (CV) | 0.84460438 |
| Kurtosis | 4.0981657 |
| Mean | 633.98559 |
| Median Absolute Deviation (MAD) | 281 |
| Skewness | 1.7732261 |
| Sum | 791848 |
| Variance | 286724.91 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 276 | 6 | 0.1% |
| 188 | 5 | 0.1% |
| 777 | 5 | 0.1% |
| 251 | 5 | 0.1% |
| 309 | 5 | 0.1% |
| 195 | 5 | 0.1% |
| 217 | 4 | 0.1% |
| 719 | 4 | 0.1% |
| 345 | 4 | 0.1% |
| 248 | 4 | 0.1% |
| Other values (811) | 1202 | 28.1% |
| (Missing) | 3023 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 2 | |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 12 | 3 | |
| 18 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3462 | 1 | |
| 3283 | 1 | |
| 3091 | 1 | |
| 3089 | 1 | |
| 3069 | 1 | |
| 2933 | 1 | |
| 2870 | 1 | |
| 2850 | 1 | |
| 2739 | 2 | |
| 2729 | 1 |
tempo_desde_o_diagnostico_ate_a_primeira_recidiv_2
Real number (ℝ)
| Distinct | 45 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 4226 |
| Missing (%) | 98.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 614.97826 |
| Minimum | 0 |
|---|---|
| Maximum | 2977 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 56.25 |
| Q1 | 231.25 |
| median | 467 |
| Q3 | 861.5 |
| 95-th percentile | 1590.75 |
| Maximum | 2977 |
| Range | 2977 |
| Interquartile range (IQR) | 630.25 |
Descriptive statistics
| Standard deviation | 560.80032 |
|---|---|
| Coefficient of variation (CV) | 0.91190267 |
| Kurtosis | 5.9611467 |
| Mean | 614.97826 |
| Median Absolute Deviation (MAD) | 271.5 |
| Skewness | 2.025504 |
| Sum | 28289 |
| Variance | 314497 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 263 | 2 | < 0.1% |
| 676 | 1 | < 0.1% |
| 244 | 1 | < 0.1% |
| 474 | 1 | < 0.1% |
| 1054 | 1 | < 0.1% |
| 353 | 1 | < 0.1% |
| 1560 | 1 | < 0.1% |
| 1696 | 1 | < 0.1% |
| 415 | 1 | < 0.1% |
| 407 | 1 | < 0.1% |
| Other values (35) | 35 | 0.8% |
| (Missing) | 4226 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 24 | 1 | |
| 49 | 1 | |
| 78 | 1 | |
| 115 | 1 | |
| 135 | 1 | |
| 155 | 1 | |
| 158 | 1 | |
| 194 | 1 | |
| 199 | 1 |
| Value | Count | Frequency (%) |
| 2977 | 1 | |
| 1696 | 1 | |
| 1601 | 1 | |
| 1560 | 1 | |
| 1457 | 1 | |
| 1054 | 1 | |
| 1044 | 1 | |
| 996 | 1 | |
| 906 | 1 | |
| 900 | 1 |
| Distinct | 23 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 3282 |
| Missing (%) | 76.8% |
| Memory size | 33.5 KiB |
| C34 - Bronquios e Pulmoes | |
|---|---|
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C71 - Encefalo | |
| C40 - Ossos e Cartilagens Articulares Dos Membros | |
| Other values (18) |
Length
| Max length | 100 |
|---|---|
| Median length | 59 |
| Mean length | 39.748485 |
| Min length | 10 |
Characters and Unicode
| Total characters | 39351 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | C34 - Bronquios e Pulmoes |
|---|---|
| 2nd row | C38 - Coração, Mediastino e Pleura, |
| 3rd row | C71 - Encefalo |
| 4th row | C71 - Encefalo |
| 5th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
Common Values
| Value | Count | Frequency (%) |
| C34 - Bronquios e Pulmoes | 258 | 6.0% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 169 | 4.0% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 142 | 3.3% |
| C71 - Encefalo | 131 | 3.1% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 87 | 2.0% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 86 | 2.0% |
| C38 - Coração, Mediastino e Pleura, | 36 | 0.8% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 21 | 0.5% |
| C50 - Mama | 15 | 0.4% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 13 | 0.3% |
| Other values (13) | 32 | 0.7% |
| (Missing) | 3282 |
Length
| Value | Count | Frequency (%) |
| 990 | 14.7% | |
| e | 818 | 12.2% |
| das | 311 | 4.6% |
| pulmoes | 258 | 3.8% |
| c34 | 258 | 3.8% |
| bronquios | 258 | 3.8% |
| ossos | 256 | 3.8% |
| cartilagens | 256 | 3.8% |
| articulares | 256 | 3.8% |
| de | 186 | 2.8% |
| Other values (78) | 2870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5727 | ||
| s | 3551 | 9.0% |
| a | 2910 | 7.4% |
| e | 2906 | 7.4% |
| o | 2385 | 6.1% |
| i | 2278 | 5.8% |
| r | 1839 | 4.7% |
| l | 1413 | 3.6% |
| C | 1304 | 3.3% |
| t | 1208 | 3.1% |
| Other values (54) | 13830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25762 | |
| Space Separator | 5727 | 14.6% |
| Uppercase Letter | 4663 | 11.8% |
| Decimal Number | 1980 | 5.0% |
| Dash Punctuation | 1139 | 2.9% |
| Other Punctuation | 78 | 0.2% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 3551 | |
| a | 2910 | |
| e | 2906 | |
| o | 2385 | |
| i | 2278 | |
| r | 1839 | 7.1% |
| l | 1413 | 5.5% |
| t | 1208 | 4.7% |
| n | 1201 | 4.7% |
| u | 1103 | 4.3% |
| Other values (21) | 4968 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1304 | |
| D | 488 | 10.5% |
| O | 447 | 9.6% |
| B | 401 | 8.6% |
| P | 327 | 7.0% |
| A | 257 | 5.5% |
| L | 255 | 5.5% |
| E | 226 | 4.8% |
| M | 185 | 4.0% |
| V | 143 | 3.1% |
| Other values (8) | 630 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 564 | |
| 7 | 313 | |
| 1 | 306 | |
| 3 | 295 | |
| 2 | 292 | |
| 0 | 108 | 5.5% |
| 8 | 58 | 2.9% |
| 5 | 20 | 1.0% |
| 9 | 13 | 0.7% |
| 6 | 11 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 5727 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1139 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 78 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30425 | |
| Common | 8926 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 3551 | 11.7% |
| a | 2910 | 9.6% |
| e | 2906 | 9.6% |
| o | 2385 | 7.8% |
| i | 2278 | 7.5% |
| r | 1839 | 6.0% |
| l | 1413 | 4.6% |
| C | 1304 | 4.3% |
| t | 1208 | 4.0% |
| n | 1201 | 3.9% |
| Other values (39) | 9430 |
Common
| Value | Count | Frequency (%) |
| 5727 | ||
| - | 1139 | 12.8% |
| 4 | 564 | 6.3% |
| 7 | 313 | 3.5% |
| 1 | 306 | 3.4% |
| 3 | 295 | 3.3% |
| 2 | 292 | 3.3% |
| 0 | 108 | 1.2% |
| , | 78 | 0.9% |
| 8 | 58 | 0.6% |
| Other values (5) | 46 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38266 | |
| None | 1085 | 2.8% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5727 | ||
| s | 3551 | 9.3% |
| a | 2910 | 7.6% |
| e | 2906 | 7.6% |
| o | 2385 | 6.2% |
| i | 2278 | 6.0% |
| r | 1839 | 4.8% |
| l | 1413 | 3.7% |
| C | 1304 | 3.4% |
| t | 1208 | 3.2% |
| Other values (46) | 12745 |
None
| Value | Count | Frequency (%) |
| á | 314 | |
| ç | 205 | |
| õ | 169 | |
| Ã | 143 | |
| ã | 122 | 11.2% |
| â | 88 | 8.1% |
| ô | 42 | 3.9% |
| é | 2 | 0.2% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 26.8% |
| Missing | 4231 |
| Missing (%) | 99.0% |
| Memory size | 33.5 KiB |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
|---|---|
| C34 - Bronquios e Pulmoes | |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | |
| Other values (6) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 42 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1722 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 7.3% |
Sample
| 1st row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
|---|---|
| 2nd row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 3rd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 4th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 5th row | C34 - Bronquios e Pulmoes |
Common Values
| Value | Count | Frequency (%) |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 10 | 0.2% |
| C34 - Bronquios e Pulmoes | 9 | 0.2% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 5 | 0.1% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 4 | 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 3 | 0.1% |
| C71 - Encefalo | 3 | 0.1% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 2 | < 0.1% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 2 | < 0.1% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 1 | < 0.1% |
| C56 - Ovario | 1 | < 0.1% |
| (Missing) | 4231 |
Length
| Value | Count | Frequency (%) |
| 41 | 14.0% | |
| e | 35 | 11.9% |
| das | 15 | 5.1% |
| c22 | 10 | 3.4% |
| fÃgado | 10 | 3.4% |
| vias | 10 | 3.4% |
| biliares | 10 | 3.4% |
| intra-hepáticas | 10 | 3.4% |
| c34 | 9 | 3.1% |
| bronquios | 9 | 3.1% |
| Other values (37) | 134 |
Most occurring characters
| Value | Count | Frequency (%) |
| 252 | ||
| s | 140 | 8.1% |
| e | 127 | 7.4% |
| a | 122 | 7.1% |
| i | 114 | 6.6% |
| o | 107 | 6.2% |
| r | 76 | 4.4% |
| t | 57 | 3.3% |
| l | 56 | 3.3% |
| n | 55 | 3.2% |
| Other values (50) | 616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1130 | |
| Space Separator | 252 | 14.6% |
| Uppercase Letter | 204 | 11.8% |
| Decimal Number | 82 | 4.8% |
| Dash Punctuation | 52 | 3.0% |
| Open Punctuation | 1 | 0.1% |
| Close Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 140 | |
| e | 127 | |
| a | 122 | |
| i | 114 | |
| o | 107 | |
| r | 76 | 6.7% |
| t | 57 | 5.0% |
| l | 56 | 5.0% |
| n | 55 | 4.9% |
| c | 48 | 4.2% |
| Other values (19) | 228 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 50 | |
| D | 21 | |
| B | 19 | 9.3% |
| O | 15 | 7.4% |
| P | 12 | 5.9% |
| F | 10 | 4.9% |
| I | 10 | 4.9% |
| V | 10 | 4.9% |
| L | 9 | 4.4% |
| A | 8 | 3.9% |
| Other values (7) | 40 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 22 | |
| 2 | 21 | |
| 7 | 12 | |
| 3 | 9 | |
| 1 | 8 | 9.8% |
| 8 | 3 | 3.7% |
| 0 | 3 | 3.7% |
| 9 | 2 | 2.4% |
| 5 | 1 | 1.2% |
| 6 | 1 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 252 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1334 | |
| Common | 388 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 140 | 10.5% |
| e | 127 | 9.5% |
| a | 122 | 9.1% |
| i | 114 | 8.5% |
| o | 107 | 8.0% |
| r | 76 | 5.7% |
| t | 57 | 4.3% |
| l | 56 | 4.2% |
| n | 55 | 4.1% |
| C | 50 | 3.7% |
| Other values (36) | 430 |
Common
| Value | Count | Frequency (%) |
| 252 | ||
| - | 52 | 13.4% |
| 4 | 22 | 5.7% |
| 2 | 21 | 5.4% |
| 7 | 12 | 3.1% |
| 3 | 9 | 2.3% |
| 1 | 8 | 2.1% |
| 8 | 3 | 0.8% |
| 0 | 3 | 0.8% |
| 9 | 2 | 0.5% |
| Other values (4) | 4 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1668 | |
| None | 54 | 3.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 252 | ||
| s | 140 | 8.4% |
| e | 127 | 7.6% |
| a | 122 | 7.3% |
| i | 114 | 6.8% |
| o | 107 | 6.4% |
| r | 76 | 4.6% |
| t | 57 | 3.4% |
| l | 56 | 3.4% |
| n | 55 | 3.3% |
| Other values (43) | 562 |
None
| Value | Count | Frequency (%) |
| á | 18 | |
| Ã | 10 | |
| ô | 6 | 11.1% |
| â | 6 | 11.1% |
| ç | 5 | 9.3% |
| õ | 5 | 9.3% |
| ã | 4 | 7.4% |
| Distinct | 21 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 3737 |
| Missing (%) | 87.5% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C34 - Bronquios e Pulmoes | |
| C71 - Encefalo | |
| Other values (16) |
Length
| Max length | 100 |
|---|---|
| Median length | 59 |
| Mean length | 46.403738 |
| Min length | 10 |
Characters and Unicode
| Total characters | 24826 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | C50 - Mama |
|---|---|
| 2nd row | C71 - Encefalo |
| 3rd row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
| 4th row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
| 5th row | C50 - Mama |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 125 | 2.9% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 94 | 2.2% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 90 | 2.1% |
| C34 - Bronquios e Pulmoes | 68 | 1.6% |
| C71 - Encefalo | 32 | 0.7% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 32 | 0.7% |
| C38 - Coração, Mediastino e Pleura, | 30 | 0.7% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 16 | 0.4% |
| C50 - Mama | 14 | 0.3% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 12 | 0.3% |
| Other values (11) | 22 | 0.5% |
| (Missing) | 3737 |
Length
| Value | Count | Frequency (%) |
| 535 | 13.1% | |
| e | 471 | 11.5% |
| das | 215 | 5.3% |
| ossos | 157 | 3.8% |
| cartilagens | 157 | 3.8% |
| articulares | 157 | 3.8% |
| de | 138 | 3.4% |
| dos | 127 | 3.1% |
| outras | 126 | 3.1% |
| c41 | 125 | 3.1% |
| Other values (75) | 1881 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3554 | ||
| s | 2216 | 8.9% |
| a | 1989 | 8.0% |
| e | 1777 | 7.2% |
| i | 1636 | 6.6% |
| o | 1354 | 5.5% |
| r | 1113 | 4.5% |
| t | 833 | 3.4% |
| c | 826 | 3.3% |
| l | 818 | 3.3% |
| Other values (53) | 8710 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16599 | |
| Space Separator | 3554 | 14.3% |
| Uppercase Letter | 2904 | 11.7% |
| Decimal Number | 1070 | 4.3% |
| Dash Punctuation | 630 | 2.5% |
| Other Punctuation | 63 | 0.3% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 2216 | |
| a | 1989 | |
| e | 1777 | |
| i | 1636 | |
| o | 1354 | |
| r | 1113 | 6.7% |
| t | 833 | 5.0% |
| c | 826 | 5.0% |
| l | 818 | 4.9% |
| n | 759 | 4.6% |
| Other values (21) | 3278 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 737 | |
| D | 342 | |
| O | 296 | |
| L | 219 | 7.5% |
| A | 161 | 5.5% |
| B | 158 | 5.4% |
| E | 128 | 4.4% |
| P | 120 | 4.1% |
| M | 114 | 3.9% |
| G | 102 | 3.5% |
| Other values (7) | 527 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 264 | |
| 7 | 234 | |
| 2 | 183 | |
| 1 | 158 | |
| 3 | 99 | 9.3% |
| 0 | 56 | 5.2% |
| 8 | 47 | 4.4% |
| 5 | 15 | 1.4% |
| 9 | 12 | 1.1% |
| 6 | 2 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 3554 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 630 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 63 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19503 | |
| Common | 5323 | 21.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 2216 | 11.4% |
| a | 1989 | 10.2% |
| e | 1777 | 9.1% |
| i | 1636 | 8.4% |
| o | 1354 | 6.9% |
| r | 1113 | 5.7% |
| t | 833 | 4.3% |
| c | 826 | 4.2% |
| l | 818 | 4.2% |
| n | 759 | 3.9% |
| Other values (38) | 6182 |
Common
| Value | Count | Frequency (%) |
| 3554 | ||
| - | 630 | 11.8% |
| 4 | 264 | 5.0% |
| 7 | 234 | 4.4% |
| 2 | 183 | 3.4% |
| 1 | 158 | 3.0% |
| 3 | 99 | 1.9% |
| , | 63 | 1.2% |
| 0 | 56 | 1.1% |
| 8 | 47 | 0.9% |
| Other values (5) | 35 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23918 | |
| None | 908 | 3.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3554 | ||
| s | 2216 | 9.3% |
| a | 1989 | 8.3% |
| e | 1777 | 7.4% |
| i | 1636 | 6.8% |
| o | 1354 | 5.7% |
| r | 1113 | 4.7% |
| t | 833 | 3.5% |
| c | 826 | 3.5% |
| l | 818 | 3.4% |
| Other values (45) | 7802 |
None
| Value | Count | Frequency (%) |
| á | 278 | |
| ç | 155 | |
| õ | 125 | |
| ã | 124 | |
| â | 100 | 11.0% |
| Ã | 90 | 9.9% |
| ô | 33 | 3.6% |
| é | 3 | 0.3% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 52.6% |
| Missing | 4253 |
| Missing (%) | 99.6% |
| Memory size | 33.5 KiB |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | |
|---|---|
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C74 - Glândula Supra-renal (Glândula Adrenal) | |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| Other values (5) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 48.263158 |
| Min length | 14 |
Characters and Unicode
| Total characters | 917 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 26.3% |
Sample
| 1st row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
|---|---|
| 2nd row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 3rd row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
| 4th row | C49 - Tecido Conjuntivo e de Outros Tecidos Moles |
| 5th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
Common Values
| Value | Count | Frequency (%) |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 5 | 0.1% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 3 | 0.1% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 2 | < 0.1% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 2 | < 0.1% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 2 | < 0.1% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 1 | < 0.1% |
| C64 - Rim, Exceto Pelve Renal | 1 | < 0.1% |
| C70 - Meninges | 1 | < 0.1% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 1 | < 0.1% |
| C34 - Bronquios e Pulmoes | 1 | < 0.1% |
| (Missing) | 4253 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 19 | 12.6% | |
| e | 15 | 9.9% |
| do | 10 | 6.6% |
| tecidos | 6 | 4.0% |
| moles | 6 | 4.0% |
| c48 | 5 | 3.3% |
| retroperitônio | 5 | 3.3% |
| peritônio | 5 | 3.3% |
| glândula | 4 | 2.6% |
| das | 4 | 2.6% |
| Other values (38) | 72 |
Most occurring characters
| Value | Count | Frequency (%) |
| 132 | ||
| e | 76 | 8.3% |
| o | 66 | 7.2% |
| i | 65 | 7.1% |
| s | 59 | 6.4% |
| a | 49 | 5.3% |
| r | 42 | 4.6% |
| n | 38 | 4.1% |
| d | 34 | 3.7% |
| l | 34 | 3.7% |
| Other values (51) | 322 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 615 | |
| Space Separator | 132 | 14.4% |
| Uppercase Letter | 104 | 11.3% |
| Decimal Number | 38 | 4.1% |
| Dash Punctuation | 23 | 2.5% |
| Close Punctuation | 2 | 0.2% |
| Open Punctuation | 2 | 0.2% |
| Other Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 76 | |
| o | 66 | |
| i | 65 | |
| s | 59 | |
| a | 49 | |
| r | 42 | 6.8% |
| n | 38 | 6.2% |
| d | 34 | 5.5% |
| l | 34 | 5.5% |
| t | 33 | 5.4% |
| Other values (20) | 119 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 23 | |
| D | 8 | 7.7% |
| M | 8 | 7.7% |
| R | 7 | 6.7% |
| P | 7 | 6.7% |
| T | 7 | 6.7% |
| G | 7 | 6.7% |
| O | 6 | 5.8% |
| S | 5 | 4.8% |
| A | 5 | 4.8% |
| Other values (7) | 21 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 13 | |
| 7 | 9 | |
| 8 | 5 | 13.2% |
| 2 | 4 | 10.5% |
| 0 | 2 | 5.3% |
| 1 | 2 | 5.3% |
| 9 | 1 | 2.6% |
| 6 | 1 | 2.6% |
| 3 | 1 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 132 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 719 | |
| Common | 198 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 76 | 10.6% |
| o | 66 | 9.2% |
| i | 65 | 9.0% |
| s | 59 | 8.2% |
| a | 49 | 6.8% |
| r | 42 | 5.8% |
| n | 38 | 5.3% |
| d | 34 | 4.7% |
| l | 34 | 4.7% |
| t | 33 | 4.6% |
| Other values (37) | 223 |
Common
| Value | Count | Frequency (%) |
| 132 | ||
| - | 23 | 11.6% |
| 4 | 13 | 6.6% |
| 7 | 9 | 4.5% |
| 8 | 5 | 2.5% |
| 2 | 4 | 2.0% |
| 0 | 2 | 1.0% |
| 1 | 2 | 1.0% |
| ) | 2 | 1.0% |
| ( | 2 | 1.0% |
| Other values (4) | 4 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 883 | |
| None | 34 | 3.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 132 | ||
| e | 76 | 8.6% |
| o | 66 | 7.5% |
| i | 65 | 7.4% |
| s | 59 | 6.7% |
| a | 49 | 5.5% |
| r | 42 | 4.8% |
| n | 38 | 4.3% |
| d | 34 | 3.9% |
| l | 34 | 3.9% |
| Other values (44) | 288 |
None
| Value | Count | Frequency (%) |
| ô | 10 | |
| á | 8 | |
| â | 7 | |
| ã | 3 | 8.8% |
| õ | 2 | 5.9% |
| ç | 2 | 5.9% |
| Ã | 2 | 5.9% |
| Distinct | 19 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 4013 |
| Missing (%) | 93.9% |
| Memory size | 33.5 KiB |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
|---|---|
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C34 - Bronquios e Pulmoes | |
| C38 - Coração, Mediastino e Pleura, | |
| Other values (14) |
Length
| Max length | 100 |
|---|---|
| Median length | 59 |
| Mean length | 45.760618 |
| Min length | 10 |
Characters and Unicode
| Total characters | 11852 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | C34 - Bronquios e Pulmoes |
|---|---|
| 2nd row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
| 3rd row | C38 - Coração, Mediastino e Pleura, |
| 4th row | C34 - Bronquios e Pulmoes |
| 5th row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
Common Values
| Value | Count | Frequency (%) |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 48 | 1.1% |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 46 | 1.1% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 44 | 1.0% |
| C34 - Bronquios e Pulmoes | 36 | 0.8% |
| C38 - Coração, Mediastino e Pleura, | 21 | 0.5% |
| C71 - Encefalo | 14 | 0.3% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 13 | 0.3% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 9 | 0.2% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 8 | 0.2% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 6 | 0.1% |
| Other values (9) | 14 | 0.3% |
| (Missing) | 4013 |
Length
| Value | Count | Frequency (%) |
| 259 | 13.2% | |
| e | 225 | 11.5% |
| das | 94 | 4.8% |
| ossos | 59 | 3.0% |
| cartilagens | 59 | 3.0% |
| articulares | 59 | 3.0% |
| dos | 59 | 3.0% |
| de | 57 | 2.9% |
| intra-hepáticas | 48 | 2.5% |
| biliares | 48 | 2.5% |
| Other values (68) | 990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1698 | 14.3% | |
| s | 977 | 8.2% |
| a | 938 | 7.9% |
| e | 843 | 7.1% |
| i | 769 | 6.5% |
| o | 658 | 5.6% |
| r | 522 | 4.4% |
| l | 410 | 3.5% |
| n | 391 | 3.3% |
| t | 373 | 3.1% |
| Other values (54) | 4273 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7858 | |
| Space Separator | 1698 | 14.3% |
| Uppercase Letter | 1398 | 11.8% |
| Decimal Number | 518 | 4.4% |
| Dash Punctuation | 317 | 2.7% |
| Other Punctuation | 47 | 0.4% |
| Open Punctuation | 8 | 0.1% |
| Close Punctuation | 8 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 977 | |
| a | 938 | |
| e | 843 | |
| i | 769 | |
| o | 658 | |
| r | 522 | 6.6% |
| l | 410 | 5.2% |
| n | 391 | 5.0% |
| t | 373 | 4.7% |
| c | 371 | 4.7% |
| Other values (21) | 1606 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 352 | |
| D | 153 | |
| O | 117 | 8.4% |
| L | 90 | 6.4% |
| B | 84 | 6.0% |
| P | 71 | 5.1% |
| A | 67 | 4.8% |
| E | 64 | 4.6% |
| G | 61 | 4.4% |
| M | 55 | 3.9% |
| Other values (8) | 284 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 125 | |
| 7 | 113 | |
| 2 | 99 | |
| 1 | 61 | |
| 3 | 58 | |
| 8 | 27 | 5.2% |
| 0 | 15 | 2.9% |
| 9 | 9 | 1.7% |
| 6 | 6 | 1.2% |
| 5 | 5 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1698 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 317 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 47 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9256 | |
| Common | 2596 | 21.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 977 | 10.6% |
| a | 938 | 10.1% |
| e | 843 | 9.1% |
| i | 769 | 8.3% |
| o | 658 | 7.1% |
| r | 522 | 5.6% |
| l | 410 | 4.4% |
| n | 391 | 4.2% |
| t | 373 | 4.0% |
| c | 371 | 4.0% |
| Other values (39) | 3004 |
Common
| Value | Count | Frequency (%) |
| 1698 | ||
| - | 317 | 12.2% |
| 4 | 125 | 4.8% |
| 7 | 113 | 4.4% |
| 2 | 99 | 3.8% |
| 1 | 61 | 2.3% |
| 3 | 58 | 2.2% |
| , | 47 | 1.8% |
| 8 | 27 | 1.0% |
| 0 | 15 | 0.6% |
| Other values (5) | 36 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11416 | |
| None | 436 | 3.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1698 | ||
| s | 977 | 8.6% |
| a | 938 | 8.2% |
| e | 843 | 7.4% |
| i | 769 | 6.7% |
| o | 658 | 5.8% |
| r | 522 | 4.6% |
| l | 410 | 3.6% |
| n | 391 | 3.4% |
| t | 373 | 3.3% |
| Other values (46) | 3837 |
None
| Value | Count | Frequency (%) |
| á | 136 | |
| ç | 67 | |
| ã | 65 | |
| â | 61 | |
| Ã | 48 | 11.0% |
| õ | 46 | 10.6% |
| ô | 12 | 2.8% |
| ó | 1 | 0.2% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 4267 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
Length
| Max length | 64 |
|---|---|
| Median length | 59 |
| Mean length | 57.4 |
| Min length | 48 |
Characters and Unicode
| Total characters | 287 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 60.0% |
Sample
| 1st row | C22 - FÃgado e Das Vias Biliares Intra-hepáticas |
|---|---|
| 2nd row | C48 - Tecidos Moles do Retroperitônio e do Peritônio |
| 3rd row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
| 4th row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
| 5th row | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 2 | < 0.1% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 1 | < 0.1% |
| C48 - Tecidos Moles do Retroperitônio e do Peritônio | 1 | < 0.1% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 1 | < 0.1% |
| (Missing) | 4267 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 10.9% | |
| e | 5 | 10.9% |
| das | 3 | 6.5% |
| c41 | 2 | 4.3% |
| de | 2 | 4.3% |
| outras | 2 | 4.3% |
| localizações | 2 | 4.3% |
| articulares | 2 | 4.3% |
| cartilagens | 2 | 4.3% |
| ossos | 2 | 4.3% |
| Other values (18) | 19 |
Most occurring characters
| Value | Count | Frequency (%) |
| 41 | ||
| s | 26 | 9.1% |
| a | 23 | 8.0% |
| e | 22 | 7.7% |
| i | 21 | 7.3% |
| o | 16 | 5.6% |
| r | 14 | 4.9% |
| t | 12 | 4.2% |
| c | 10 | 3.5% |
| l | 9 | 3.1% |
| Other values (38) | 93 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 198 | |
| Space Separator | 41 | 14.3% |
| Uppercase Letter | 32 | 11.1% |
| Decimal Number | 10 | 3.5% |
| Dash Punctuation | 6 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 26 | |
| a | 23 | |
| e | 22 | |
| i | 21 | |
| o | 16 | |
| r | 14 | 7.1% |
| t | 12 | 6.1% |
| c | 10 | 5.1% |
| l | 9 | 4.5% |
| d | 8 | 4.0% |
| Other values (14) | 37 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7 | |
| D | 4 | |
| O | 4 | |
| L | 3 | |
| A | 2 | 6.2% |
| F | 1 | 3.1% |
| G | 1 | 3.1% |
| E | 1 | 3.1% |
| N | 1 | 3.1% |
| S | 1 | 3.1% |
| Other values (7) | 7 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3 | |
| 7 | 2 | |
| 1 | 2 | |
| 2 | 2 | |
| 8 | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 41 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 230 | |
| Common | 57 | 19.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 26 | 11.3% |
| a | 23 | 10.0% |
| e | 22 | 9.6% |
| i | 21 | 9.1% |
| o | 16 | 7.0% |
| r | 14 | 6.1% |
| t | 12 | 5.2% |
| c | 10 | 4.3% |
| l | 9 | 3.9% |
| d | 8 | 3.5% |
| Other values (31) | 69 |
Common
| Value | Count | Frequency (%) |
| 41 | ||
| - | 6 | 10.5% |
| 4 | 3 | 5.3% |
| 7 | 2 | 3.5% |
| 1 | 2 | 3.5% |
| 2 | 2 | 3.5% |
| 8 | 1 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 275 | |
| None | 12 | 4.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 41 | ||
| s | 26 | 9.5% |
| a | 23 | 8.4% |
| e | 22 | 8.0% |
| i | 21 | 7.6% |
| o | 16 | 5.8% |
| r | 14 | 5.1% |
| t | 12 | 4.4% |
| c | 10 | 3.6% |
| l | 9 | 3.3% |
| Other values (31) | 81 |
None
| Value | Count | Frequency (%) |
| á | 3 | |
| ô | 2 | |
| õ | 2 | |
| ç | 2 | |
| ã | 1 | 8.3% |
| Ã | 1 | 8.3% |
| â | 1 | 8.3% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 14.4% |
| Missing | 4161 |
| Missing (%) | 97.4% |
| Memory size | 33.5 KiB |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | |
|---|---|
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | |
| C71 - Encefalo | |
| C34 - Bronquios e Pulmoes | |
| Other values (11) |
Length
| Max length | 64 |
|---|---|
| Median length | 49 |
| Mean length | 43.099099 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4784 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | C50 - Mama |
|---|---|
| 2nd row | C44 - Pele nao-melanoma |
| 3rd row | C42 - Sistema hematopoiético e reticuloendotelial |
| 4th row | C74 - Glândula Supra-renal (Glândula Adrenal) |
| 5th row | C06 - Outras parte da Boca |
Common Values
| Value | Count | Frequency (%) |
| C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | 23 | 0.5% |
| C22 - FÃgado e Das Vias Biliares Intra-hepáticas | 15 | 0.4% |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 14 | 0.3% |
| C71 - Encefalo | 11 | 0.3% |
| C34 - Bronquios e Pulmoes | 9 | 0.2% |
| C38 - Coração, Mediastino e Pleura, | 7 | 0.2% |
| C74 - Glândula Supra-renal (Glândula Adrenal) | 5 | 0.1% |
| C50 - Mama | 5 | 0.1% |
| C40 - Ossos e Cartilagens Articulares Dos Membros | 5 | 0.1% |
| C49 - Tecido Conjuntivo e de Outros Tecidos Moles | 4 | 0.1% |
| Other values (6) | 13 | 0.3% |
| (Missing) | 4161 |
Length
| Value | Count | Frequency (%) |
| 111 | 14.0% | |
| e | 82 | 10.4% |
| das | 38 | 4.8% |
| ossos | 28 | 3.5% |
| cartilagens | 28 | 3.5% |
| articulares | 28 | 3.5% |
| de | 27 | 3.4% |
| outras | 24 | 3.0% |
| c41 | 23 | 2.9% |
| localizações | 23 | 2.9% |
| Other values (57) | 379 |
Most occurring characters
| Value | Count | Frequency (%) |
| 680 | 14.2% | |
| a | 393 | 8.2% |
| s | 384 | 8.0% |
| e | 353 | 7.4% |
| i | 288 | 6.0% |
| o | 253 | 5.3% |
| r | 211 | 4.4% |
| l | 186 | 3.9% |
| t | 161 | 3.4% |
| n | 160 | 3.3% |
| Other values (53) | 1715 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3166 | |
| Space Separator | 680 | 14.2% |
| Uppercase Letter | 557 | 11.6% |
| Decimal Number | 222 | 4.6% |
| Dash Punctuation | 133 | 2.8% |
| Other Punctuation | 16 | 0.3% |
| Close Punctuation | 5 | 0.1% |
| Open Punctuation | 5 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 393 | |
| s | 384 | |
| e | 353 | |
| i | 288 | |
| o | 253 | |
| r | 211 | 6.7% |
| l | 186 | 5.9% |
| t | 161 | 5.1% |
| n | 160 | 5.1% |
| c | 151 | 4.8% |
| Other values (21) | 626 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 150 | |
| D | 57 | 10.2% |
| O | 56 | 10.1% |
| L | 37 | 6.6% |
| A | 33 | 5.9% |
| M | 27 | 4.8% |
| E | 27 | 4.8% |
| B | 25 | 4.5% |
| G | 24 | 4.3% |
| P | 23 | 4.1% |
| Other values (7) | 98 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 57 | |
| 7 | 47 | |
| 1 | 34 | |
| 2 | 32 | |
| 3 | 16 | 7.2% |
| 0 | 14 | 6.3% |
| 8 | 10 | 4.5% |
| 5 | 5 | 2.3% |
| 9 | 4 | 1.8% |
| 6 | 3 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 680 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 133 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3723 | |
| Common | 1061 | 22.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 393 | 10.6% |
| s | 384 | 10.3% |
| e | 353 | 9.5% |
| i | 288 | 7.7% |
| o | 253 | 6.8% |
| r | 211 | 5.7% |
| l | 186 | 5.0% |
| t | 161 | 4.3% |
| n | 160 | 4.3% |
| c | 151 | 4.1% |
| Other values (38) | 1183 |
Common
| Value | Count | Frequency (%) |
| 680 | ||
| - | 133 | 12.5% |
| 4 | 57 | 5.4% |
| 7 | 47 | 4.4% |
| 1 | 34 | 3.2% |
| 2 | 32 | 3.0% |
| 3 | 16 | 1.5% |
| , | 16 | 1.5% |
| 0 | 14 | 1.3% |
| 8 | 10 | 0.9% |
| Other values (5) | 22 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4620 | |
| None | 164 | 3.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 680 | ||
| a | 393 | 8.5% |
| s | 384 | 8.3% |
| e | 353 | 7.6% |
| i | 288 | 6.2% |
| o | 253 | 5.5% |
| r | 211 | 4.6% |
| l | 186 | 4.0% |
| t | 161 | 3.5% |
| n | 160 | 3.5% |
| Other values (45) | 1551 |
None
| Value | Count | Frequency (%) |
| á | 43 | |
| ç | 30 | |
| â | 24 | |
| õ | 23 | |
| ã | 21 | |
| Ã | 15 | 9.1% |
| ô | 6 | 3.7% |
| é | 2 | 1.2% |
local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_2
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4271 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
|---|
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Characters and Unicode
| Total characters | 59 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos |
|---|
Common Values
| Value | Count | Frequency (%) |
| C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | 1 | < 0.1% |
| (Missing) | 4271 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| c77 | 1 | |
| 1 | ||
| secundária | 1 | |
| e | 1 | |
| não | 1 | |
| especificada | 1 | |
| dos | 1 | |
| gânglios | 1 | |
| linfáticos | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 13.6% | |
| i | 6 | 10.2% |
| s | 4 | 6.8% |
| o | 4 | 6.8% |
| c | 4 | 6.8% |
| e | 3 | 5.1% |
| a | 3 | 5.1% |
| n | 3 | 5.1% |
| á | 2 | 3.4% |
| 7 | 2 | 3.4% |
| Other values (18) | 20 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41 | |
| Space Separator | 8 | 13.6% |
| Uppercase Letter | 7 | 11.9% |
| Decimal Number | 2 | 3.4% |
| Dash Punctuation | 1 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6 | |
| s | 4 | |
| o | 4 | |
| c | 4 | |
| e | 3 | 7.3% |
| a | 3 | 7.3% |
| n | 3 | 7.3% |
| á | 2 | 4.9% |
| d | 2 | 4.9% |
| f | 2 | 4.9% |
| Other values (8) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| L | 1 | |
| D | 1 | |
| C | 1 | |
| N | 1 | |
| E | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48 | |
| Common | 11 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6 | 12.5% |
| s | 4 | 8.3% |
| o | 4 | 8.3% |
| c | 4 | 8.3% |
| e | 3 | 6.2% |
| a | 3 | 6.2% |
| n | 3 | 6.2% |
| á | 2 | 4.2% |
| d | 2 | 4.2% |
| f | 2 | 4.2% |
| Other values (15) | 15 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| 7 | 2 | 18.2% |
| - | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55 | |
| None | 4 | 6.8% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | ||
| i | 6 | 10.9% |
| s | 4 | 7.3% |
| o | 4 | 7.3% |
| c | 4 | 7.3% |
| e | 3 | 5.5% |
| a | 3 | 5.5% |
| n | 3 | 5.5% |
| 7 | 2 | 3.6% |
| d | 2 | 3.6% |
| Other values (15) | 16 |
None
| Value | Count | Frequency (%) |
| á | 2 | |
| â | 1 | |
| ã | 1 |
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| CARCINOMA DUCTAL INFILTRANTE SOE | |
|---|---|
| CARCINOMA LOBULAR SOE | 140 |
| ADENOCARCINOMA MUCINOSO | 49 |
| CARCINOMA METAPLASICO SOE | 46 |
| CARCINOMA PAPILAR SOE | 38 |
| Other values (38) | 206 |
Length
| Max length | 65 |
|---|---|
| Median length | 32 |
| Mean length | 31.561564 |
| Min length | 11 |
Characters and Unicode
| Total characters | 134831 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | CARCINOMA DUCTAL INFILTRANTE SOE |
|---|---|
| 2nd row | CARCINOMA DUCTAL INFILTRANTE SOE |
| 3rd row | ADENOCARCINOMA MUCINOSO |
| 4th row | CARCINOMA DUCTAL INFILTRANTE SOE |
| 5th row | CARCINOMA DUCTAL INFILTRANTE SOE |
Common Values
| Value | Count | Frequency (%) |
| CARCINOMA DUCTAL INFILTRANTE SOE | 3793 | |
| CARCINOMA LOBULAR SOE | 140 | 3.3% |
| ADENOCARCINOMA MUCINOSO | 49 | 1.1% |
| CARCINOMA METAPLASICO SOE | 46 | 1.1% |
| CARCINOMA PAPILAR SOE | 38 | 0.9% |
| CARCINOMA INTRADUCTAL NAO INFILTRANTE SOE | 30 | 0.7% |
| ADENOCARCINOMA PAPILAR INTRADUCTAL COM INVASAO | 19 | 0.4% |
| CARCINOMA DE CELULAS ACINOSAS | 19 | 0.4% |
| CARCINOMA DUCTAL INFILTRATIVO MISTO COM OUTROS TIPOS DE CARCINOMA | 18 | 0.4% |
| CARCINOMA DUCTAL INFILTRANTE E LOBULAR | 14 | 0.3% |
| Other values (33) | 106 | 2.5% |
Length
| Value | Count | Frequency (%) |
| carcinoma | 4194 | |
| soe | 4092 | |
| infiltrante | 3857 | |
| ductal | 3833 | |
| lobular | 164 | 1.0% |
| adenocarcinoma | 90 | 0.5% |
| papilar | 65 | 0.4% |
| de | 63 | 0.4% |
| intraductal | 51 | 0.3% |
| mucinoso | 49 | 0.3% |
| Other values (48) | 449 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 17125 | |
| C | 12706 | |
| 12635 | ||
| I | 12438 | |
| N | 12330 | |
| T | 11859 | |
| O | 9113 | |
| R | 8526 | 6.3% |
| L | 8286 | 6.1% |
| E | 8282 | 6.1% |
| Other values (11) | 21531 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 122196 | |
| Space Separator | 12635 | 9.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17125 | |
| C | 12706 | |
| I | 12438 | |
| N | 12330 | |
| T | 11859 | |
| O | 9113 | |
| R | 8526 | |
| L | 8286 | |
| E | 8282 | |
| M | 4516 | 3.7% |
| Other values (10) | 17015 |
Space Separator
| Value | Count | Frequency (%) |
| 12635 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 122196 | |
| Common | 12635 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17125 | |
| C | 12706 | |
| I | 12438 | |
| N | 12330 | |
| T | 11859 | |
| O | 9113 | |
| R | 8526 | |
| L | 8286 | |
| E | 8282 | |
| M | 4516 | 3.7% |
| Other values (10) | 17015 |
Common
| Value | Count | Frequency (%) |
| 12635 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134831 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 17125 | |
| C | 12706 | |
| 12635 | ||
| I | 12438 | |
| N | 12330 | |
| T | 11859 | |
| O | 9113 | |
| R | 8526 | 6.3% |
| L | 8286 | 6.1% |
| E | 8282 | 6.1% |
| Other values (11) | 21531 |
descricao_da_morfologia_de_acordo_com_cid_o_2
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 73 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| CARCINOMA DUCTAL INFILTRANTE SOE | |
|---|---|
| CARCINOMA INTRADUCTAL NAO INFILTRANTE SOE | |
| CARCINOMA ESCAMOCELULAR SOE | |
| ADENOCARCINOMA SOE | |
| CARCINOMA BASOCELULAR NODULAR | 13 |
| Other values (68) |
Length
| Max length | 67 |
|---|---|
| Median length | 58 |
| Mean length | 30.802168 |
| Min length | 12 |
Characters and Unicode
| Total characters | 11366 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 40 ? |
|---|---|
| Unique (%) | 10.8% |
Sample
| 1st row | ADENOCARCINOMA SOE |
|---|---|
| 2nd row | ADENOCARCINOMA SOE |
| 3rd row | CARCINOMA DUCTAL INFILTRANTE SOE |
| 4th row | MELANOMA DE PROPAGACAO SUPERFICIAL |
| 5th row | MELANOMA MALIGNO SOE |
Common Values
| Value | Count | Frequency (%) |
| CARCINOMA DUCTAL INFILTRANTE SOE | 111 | 2.6% |
| CARCINOMA INTRADUCTAL NAO INFILTRANTE SOE | 43 | 1.0% |
| CARCINOMA ESCAMOCELULAR SOE | 25 | 0.6% |
| ADENOCARCINOMA SOE | 23 | 0.5% |
| CARCINOMA BASOCELULAR NODULAR | 13 | 0.3% |
| CARCINOMA LOBULAR SOE | 11 | 0.3% |
| CARCINOMA DE CELULAS RENAIS SOE | 11 | 0.3% |
| ADENOCARCINOMA TUBULAR | 10 | 0.2% |
| ADENOCARCINOMA PAPILAR SOE | 9 | 0.2% |
| CARCINOMA DE CELULAS ACINOSAS | 8 | 0.2% |
| Other values (63) | 105 | 2.5% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| soe | 290 | |
| carcinoma | 265 | |
| infiltrante | 156 | |
| ductal | 113 | 8.1% |
| adenocarcinoma | 65 | 4.6% |
| de | 48 | 3.4% |
| intraductal | 47 | 3.4% |
| nao | 47 | 3.4% |
| celulas | 34 | 2.4% |
| escamocelular | 30 | 2.1% |
| Other values (115) | 306 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1564 | |
| C | 1034 | |
| 1032 | ||
| O | 1015 | |
| N | 958 | |
| I | 914 | |
| E | 820 | |
| R | 748 | 6.6% |
| L | 634 | 5.6% |
| T | 610 | 5.4% |
| Other values (28) | 2037 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10310 | |
| Space Separator | 1032 | 9.1% |
| Decimal Number | 10 | 0.1% |
| Dash Punctuation | 4 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1564 | |
| C | 1034 | |
| O | 1015 | |
| N | 958 | |
| I | 914 | |
| E | 820 | |
| R | 748 | |
| L | 634 | |
| T | 610 | 5.9% |
| S | 499 | 4.8% |
| Other values (14) | 1514 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 2 | |
| 8 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 9 | 1 | |
| 5 | 1 | |
| 3 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2 | |
| \ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1032 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10310 | |
| Common | 1056 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1564 | |
| C | 1034 | |
| O | 1015 | |
| N | 958 | |
| I | 914 | |
| E | 820 | |
| R | 748 | |
| L | 634 | |
| T | 610 | 5.9% |
| S | 499 | 4.8% |
| Other values (14) | 1514 |
Common
| Value | Count | Frequency (%) |
| 1032 | ||
| - | 4 | 0.4% |
| ( | 3 | 0.3% |
| ) | 3 | 0.3% |
| 2 | 2 | 0.2% |
| 0 | 2 | 0.2% |
| / | 2 | 0.2% |
| \ | 2 | 0.2% |
| 8 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| Other values (4) | 4 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1564 | |
| C | 1034 | |
| 1032 | ||
| O | 1015 | |
| N | 958 | |
| I | 914 | |
| E | 820 | |
| R | 748 | 6.6% |
| L | 634 | 5.6% |
| T | 610 | 5.4% |
| Other values (28) | 2037 |
descricao_da_topografia_1
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| MAMA SOE (EXCLUI PELE DA MAMA C44.5) | |
|---|---|
| MAMA QUADRANTE SUPERIOR EXTERNO DA | |
| MAMA QUADRANTE SUPERIOR INTERNO DA | |
| MAMA QUADRANTE INFERIOR EXTERNO DA | |
| MAMA LESAO SOBREPOSTA DA | 175 |
| Other values (8) |
Length
| Max length | 36 |
|---|---|
| Median length | 34 |
| Mean length | 33.171348 |
| Min length | 4 |
Characters and Unicode
| Total characters | 141708 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | MAMA QUADRANTE SUPERIOR EXTERNO DA |
|---|---|
| 2nd row | MAMA LESAO SOBREPOSTA DA |
| 3rd row | MAMA SOE (EXCLUI PELE DA MAMA C44.5) |
| 4th row | MAMA QUADRANTE INFERIOR EXTERNO DA |
| 5th row | MAMA LESAO SOBREPOSTA DA |
Common Values
| Value | Count | Frequency (%) |
| MAMA SOE (EXCLUI PELE DA MAMA C44.5) | 1927 | |
| MAMA QUADRANTE SUPERIOR EXTERNO DA | 1155 | |
| MAMA QUADRANTE SUPERIOR INTERNO DA | 282 | 6.6% |
| MAMA QUADRANTE INFERIOR EXTERNO DA | 247 | 5.8% |
| MAMA LESAO SOBREPOSTA DA | 175 | 4.1% |
| MAMA QUADRANTE INFERIOR INTERNO DA | 174 | 4.1% |
| MAMA MAMILO | 168 | 3.9% |
| MAMA PORCAO CENTRAL DA | 131 | 3.1% |
| MAMA PORCAO AXILAR DA | 9 | 0.2% |
| ASSOALHO DA BOCA SOE | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
Length
| Value | Count | Frequency (%) |
| mama | 6195 | |
| da | 4101 | |
| soe | 1929 | 7.9% |
| exclui | 1927 | 7.9% |
| pele | 1927 | 7.9% |
| c44.5 | 1927 | 7.9% |
| quadrante | 1858 | 7.6% |
| superior | 1437 | 5.9% |
| externo | 1402 | 5.7% |
| interno | 456 | 1.9% |
| Other values (14) | 1226 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 21017 | |
| 20113 | ||
| E | 15170 | |
| M | 12726 | 9.0% |
| R | 7889 | 5.6% |
| O | 6627 | 4.7% |
| D | 5960 | 4.2% |
| U | 5223 | 3.7% |
| I | 4839 | 3.4% |
| N | 4724 | 3.3% |
| Other values (15) | 37420 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 110033 | |
| Space Separator | 20113 | 14.2% |
| Decimal Number | 5781 | 4.1% |
| Close Punctuation | 1927 | 1.4% |
| Other Punctuation | 1927 | 1.4% |
| Open Punctuation | 1927 | 1.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 21017 | |
| E | 15170 | |
| M | 12726 | |
| R | 7889 | 7.2% |
| O | 6627 | 6.0% |
| D | 5960 | 5.4% |
| U | 5223 | 4.7% |
| I | 4839 | 4.4% |
| N | 4724 | 4.3% |
| L | 4339 | 3.9% |
| Other values (9) | 21519 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 3854 | |
| 5 | 1927 |
Space Separator
| Value | Count | Frequency (%) |
| 20113 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1927 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1927 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1927 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 110033 | |
| Common | 31675 | 22.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 21017 | |
| E | 15170 | |
| M | 12726 | |
| R | 7889 | 7.2% |
| O | 6627 | 6.0% |
| D | 5960 | 5.4% |
| U | 5223 | 4.7% |
| I | 4839 | 4.4% |
| N | 4724 | 4.3% |
| L | 4339 | 3.9% |
| Other values (9) | 21519 |
Common
| Value | Count | Frequency (%) |
| 20113 | ||
| 4 | 3854 | 12.2% |
| 5 | 1927 | 6.1% |
| ) | 1927 | 6.1% |
| . | 1927 | 6.1% |
| ( | 1927 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 141708 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 21017 | |
| 20113 | ||
| E | 15170 | |
| M | 12726 | 9.0% |
| R | 7889 | 5.6% |
| O | 6627 | 4.7% |
| D | 5960 | 4.2% |
| U | 5223 | 3.7% |
| I | 4839 | 3.4% |
| N | 4724 | 3.3% |
| Other values (15) | 37420 |
descricao_da_topografia_2
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 72 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| MAMA SOE (EXCLUI PELE DA MAMA C44.5) | |
|---|---|
| MAMA QUADRANTE SUPERIOR EXTERNO DA | |
| GLANDULA TIREOIDE | 17 |
| MAMA QUADRANTE SUPERIOR INTERNO DA | 15 |
| PELE DO OMBRO E MEMBROS SUPERIORES | 12 |
| Other values (67) |
Length
| Max length | 77 |
|---|---|
| Median length | 59 |
| Mean length | 26.96748 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9951 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | COLO DO UTERO |
|---|---|
| 2nd row | COLON DESCENDENTE |
| 3rd row | MAMA SOE (EXCLUI PELE DA MAMA C44.5) |
| 4th row | PELE DO TRONCO |
| 5th row | PELE DO QUADRIL E MEMBROS INFERIORES |
Common Values
| Value | Count | Frequency (%) |
| MAMA SOE (EXCLUI PELE DA MAMA C44.5) | 87 | 2.0% |
| MAMA QUADRANTE SUPERIOR EXTERNO DA | 42 | 1.0% |
| GLANDULA TIREOIDE | 17 | 0.4% |
| MAMA QUADRANTE SUPERIOR INTERNO DA | 15 | 0.4% |
| PELE DO OMBRO E MEMBROS SUPERIORES | 12 | 0.3% |
| RIM SOE | 12 | 0.3% |
| MAMA QUADRANTE INFERIOR INTERNO DA | 12 | 0.3% |
| MAMA LESAO SOBREPOSTA DA | 11 | 0.3% |
| MAMA QUADRANTE INFERIOR EXTERNO DA | 10 | 0.2% |
| PELE DE OUTRAS PARTES E DE PARTES NAO ESPECIFICADAS DA FACE | 9 | 0.2% |
| Other values (62) | 142 | 3.3% |
| (Missing) | 3903 |
Length
| Value | Count | Frequency (%) |
| mama | 274 | |
| da | 197 | 11.7% |
| pele | 125 | 7.4% |
| soe | 120 | 7.1% |
| exclui | 87 | 5.2% |
| c44.5 | 87 | 5.2% |
| quadrante | 79 | 4.7% |
| superior | 65 | 3.9% |
| do | 60 | 3.6% |
| externo | 54 | 3.2% |
| Other values (122) | 539 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1319 | ||
| A | 1218 | |
| E | 1125 | |
| O | 760 | 7.6% |
| M | 673 | 6.8% |
| R | 618 | 6.2% |
| D | 458 | 4.6% |
| I | 437 | 4.4% |
| S | 397 | 4.0% |
| L | 358 | 3.6% |
| Other values (20) | 2588 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8105 | |
| Space Separator | 1319 | 13.3% |
| Decimal Number | 261 | 2.6% |
| Other Punctuation | 89 | 0.9% |
| Close Punctuation | 88 | 0.9% |
| Open Punctuation | 88 | 0.9% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1218 | |
| E | 1125 | |
| O | 760 | |
| M | 673 | 8.3% |
| R | 618 | 7.6% |
| D | 458 | 5.7% |
| I | 437 | 5.4% |
| S | 397 | 4.9% |
| L | 358 | 4.4% |
| U | 338 | 4.2% |
| Other values (13) | 1723 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 174 | |
| 5 | 87 |
Space Separator
| Value | Count | Frequency (%) |
| 1319 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 89 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 88 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 88 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8105 | |
| Common | 1846 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1218 | |
| E | 1125 | |
| O | 760 | |
| M | 673 | 8.3% |
| R | 618 | 7.6% |
| D | 458 | 5.7% |
| I | 437 | 5.4% |
| S | 397 | 4.9% |
| L | 358 | 4.4% |
| U | 338 | 4.2% |
| Other values (13) | 1723 |
Common
| Value | Count | Frequency (%) |
| 1319 | ||
| 4 | 174 | 9.4% |
| . | 89 | 4.8% |
| ) | 88 | 4.8% |
| ( | 88 | 4.8% |
| 5 | 87 | 4.7% |
| - | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9951 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1319 | ||
| A | 1218 | |
| E | 1125 | |
| O | 760 | 7.6% |
| M | 673 | 6.8% |
| R | 618 | 6.2% |
| D | 458 | 4.6% |
| I | 437 | 4.4% |
| S | 397 | 4.0% |
| L | 358 | 3.6% |
| Other values (20) | 2588 |
classificacao_tnm_patologico_n_1
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 4086 |
| Missing (%) | 95.6% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| 2A | 9 |
| 3 | 8 |
| Other values (5) | 10 |
Length
| Max length | 31 |
|---|---|
| Median length | 1 |
| Mean length | 1.4301075 |
| Min length | 1 |
Characters and Unicode
| Total characters | 266 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | X - nao foi possivel determinar |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 101 | 2.4% |
| 1 | 44 | 1.0% |
| 2 | 14 | 0.3% |
| 2A | 9 | 0.2% |
| 3 | 8 | 0.2% |
| 3A | 4 | 0.1% |
| X - nao foi possivel determinar | 2 | < 0.1% |
| 3B | 2 | < 0.1% |
| 3C | 1 | < 0.1% |
| Y: Na | 1 | < 0.1% |
| (Missing) | 4086 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 101 | |
| 1 | 44 | |
| 2 | 14 | 7.1% |
| 2a | 9 | 4.6% |
| 3 | 8 | 4.1% |
| 3a | 4 | 2.0% |
| x | 2 | 1.0% |
| 2 | 1.0% | |
| nao | 2 | 1.0% |
| foi | 2 | 1.0% |
| Other values (6) | 9 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 101 | |
| 1 | 44 | |
| 2 | 23 | 8.6% |
| 3 | 15 | 5.6% |
| A | 13 | 4.9% |
| 11 | 4.1% | |
| o | 6 | 2.3% |
| i | 6 | 2.3% |
| e | 6 | 2.3% |
| a | 5 | 1.9% |
| Other values (17) | 36 | 13.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 183 | |
| Lowercase Letter | 49 | 18.4% |
| Uppercase Letter | 20 | 7.5% |
| Space Separator | 11 | 4.1% |
| Dash Punctuation | 2 | 0.8% |
| Other Punctuation | 1 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| i | 6 | |
| e | 6 | |
| a | 5 | |
| r | 4 | |
| s | 4 | |
| n | 4 | |
| d | 2 | 4.1% |
| m | 2 | 4.1% |
| t | 2 | 4.1% |
| Other values (4) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 13 | |
| B | 2 | 10.0% |
| X | 2 | 10.0% |
| C | 1 | 5.0% |
| Y | 1 | 5.0% |
| N | 1 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 101 | |
| 1 | 44 | |
| 2 | 23 | 12.6% |
| 3 | 15 | 8.2% |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 197 | |
| Latin | 69 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 13 | |
| o | 6 | 8.7% |
| i | 6 | 8.7% |
| e | 6 | 8.7% |
| a | 5 | 7.2% |
| r | 4 | 5.8% |
| s | 4 | 5.8% |
| n | 4 | 5.8% |
| d | 2 | 2.9% |
| m | 2 | 2.9% |
| Other values (10) | 17 |
Common
| Value | Count | Frequency (%) |
| 0 | 101 | |
| 1 | 44 | |
| 2 | 23 | 11.7% |
| 3 | 15 | 7.6% |
| 11 | 5.6% | |
| - | 2 | 1.0% |
| : | 1 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 266 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 101 | |
| 1 | 44 | |
| 2 | 23 | 8.6% |
| 3 | 15 | 5.6% |
| A | 13 | 4.9% |
| 11 | 4.1% | |
| o | 6 | 2.3% |
| i | 6 | 2.3% |
| e | 6 | 2.3% |
| a | 5 | 1.9% |
| Other values (17) | 36 | 13.5% |
classificacao_tnm_patologico_n_2
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 4267 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4 | 0.1% |
| 1 | 1 | < 0.1% |
| (Missing) | 4267 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 1 | 20.0% |
classificacao_tnm_patologico_t_1
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 4085 |
| Missing (%) | 95.6% |
| Memory size | 33.5 KiB |
| 2 | |
|---|---|
| 1C | |
| 3 | |
| 1 | |
| 1B | |
| Other values (9) |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3957219 |
| Min length | 1 |
Characters and Unicode
| Total characters | 261 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1A |
| 3rd row | 1C |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 81 | 1.9% |
| 1C | 33 | 0.8% |
| 3 | 21 | 0.5% |
| 1 | 16 | 0.4% |
| 1B | 11 | 0.3% |
| 1A | 7 | 0.2% |
| 4B | 7 | 0.2% |
| IS | 3 | 0.1% |
| IV | 3 | 0.1% |
| 4D | 1 | < 0.1% |
| Other values (4) | 4 | 0.1% |
| (Missing) | 4085 |
Length
| Value | Count | Frequency (%) |
| 2 | 81 | |
| 1c | 33 | |
| 3 | 21 | 11.2% |
| 1 | 16 | 8.5% |
| 1b | 11 | 5.9% |
| 1a | 7 | 3.7% |
| 4b | 7 | 3.7% |
| is | 3 | 1.6% |
| iv | 3 | 1.6% |
| 4d | 1 | 0.5% |
| Other values (5) | 5 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 82 | |
| 1 | 68 | |
| C | 36 | |
| 3 | 21 | 8.0% |
| B | 18 | 6.9% |
| 4 | 9 | 3.4% |
| A | 7 | 2.7% |
| I | 7 | 2.7% |
| V | 3 | 1.1% |
| S | 3 | 1.1% |
| Other values (7) | 7 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 180 | |
| Uppercase Letter | 78 | |
| Other Punctuation | 1 | 0.4% |
| Space Separator | 1 | 0.4% |
| Lowercase Letter | 1 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 36 | |
| B | 18 | |
| A | 7 | 9.0% |
| I | 7 | 9.0% |
| V | 3 | 3.8% |
| S | 3 | 3.8% |
| D | 1 | 1.3% |
| M | 1 | 1.3% |
| Y | 1 | 1.3% |
| N | 1 | 1.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 82 | |
| 1 | 68 | |
| 3 | 21 | 11.7% |
| 4 | 9 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 182 | |
| Latin | 79 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 36 | |
| B | 18 | |
| A | 7 | 8.9% |
| I | 7 | 8.9% |
| V | 3 | 3.8% |
| S | 3 | 3.8% |
| D | 1 | 1.3% |
| M | 1 | 1.3% |
| Y | 1 | 1.3% |
| N | 1 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 2 | 82 | |
| 1 | 68 | |
| 3 | 21 | 11.5% |
| 4 | 9 | 4.9% |
| : | 1 | 0.5% |
| 1 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 82 | |
| 1 | 68 | |
| C | 36 | |
| 3 | 21 | 8.0% |
| B | 18 | 6.9% |
| 4 | 9 | 3.4% |
| A | 7 | 2.7% |
| I | 7 | 2.7% |
| V | 3 | 1.1% |
| S | 3 | 1.1% |
| Other values (7) | 7 | 2.7% |
classificacao_tnm_patologico_t_2
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 60.0% |
| Missing | 4267 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| 1B | |
|---|---|
| 1 | |
| IV |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.6 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | 1B |
|---|---|
| 2nd row | 1 |
| 3rd row | 1B |
| 4th row | IV |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1B | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| IV | 1 | < 0.1% |
| (Missing) | 4267 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1b | 2 | |
| 1 | 2 | |
| iv | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4 | |
| B | 2 | |
| I | 1 | 12.5% |
| V | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Uppercase Letter | 4 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| I | 1 | |
| V | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 | |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 2 | |
| I | 1 | |
| V | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4 | |
| B | 2 | |
| I | 1 | 12.5% |
| V | 1 | 12.5% |
com_recidiva_a_distancia_1
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12816 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 3503 | |
| Sim | 769 | 18.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 3503 | |
| sim | 769 | 18.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 3503 | |
| ã | 3503 | |
| o | 3503 | |
| S | 769 | 6.0% |
| i | 769 | 6.0% |
| m | 769 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8544 | |
| Uppercase Letter | 4272 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 3503 | |
| o | 3503 | |
| i | 769 | 9.0% |
| m | 769 | 9.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3503 | |
| S | 769 | 18.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12816 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 3503 | |
| ã | 3503 | |
| o | 3503 | |
| S | 769 | 6.0% |
| i | 769 | 6.0% |
| m | 769 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9313 | |
| None | 3503 | 27.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 3503 | |
| o | 3503 | |
| S | 769 | 8.3% |
| i | 769 | 8.3% |
| m | 769 | 8.3% |
None
| Value | Count | Frequency (%) |
| ã | 3503 |
com_recidiva_a_distancia_2
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1107 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Sim |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 329 | 7.7% |
| Sim | 40 | 0.9% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 329 | |
| sim | 40 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 329 | |
| ã | 329 | |
| o | 329 | |
| S | 40 | 3.6% |
| i | 40 | 3.6% |
| m | 40 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 738 | |
| Uppercase Letter | 369 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 329 | |
| o | 329 | |
| i | 40 | 5.4% |
| m | 40 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 329 | |
| S | 40 | 10.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1107 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 329 | |
| ã | 329 | |
| o | 329 | |
| S | 40 | 3.6% |
| i | 40 | 3.6% |
| m | 40 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 778 | |
| None | 329 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 329 | |
| o | 329 | |
| S | 40 | 5.1% |
| i | 40 | 5.1% |
| m | 40 | 5.1% |
None
| Value | Count | Frequency (%) |
| ã | 329 |
com_recidiva_regional_1
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim | 270 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12816 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Sim |
| 3rd row | Não |
| 4th row | Sim |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 4002 | |
| Sim | 270 | 6.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 4002 | |
| sim | 270 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 4002 | |
| ã | 4002 | |
| o | 4002 | |
| S | 270 | 2.1% |
| i | 270 | 2.1% |
| m | 270 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8544 | |
| Uppercase Letter | 4272 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 4002 | |
| o | 4002 | |
| i | 270 | 3.2% |
| m | 270 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4002 | |
| S | 270 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12816 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 4002 | |
| ã | 4002 | |
| o | 4002 | |
| S | 270 | 2.1% |
| i | 270 | 2.1% |
| m | 270 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8814 | |
| None | 4002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 4002 | |
| o | 4002 | |
| S | 270 | 3.1% |
| i | 270 | 3.1% |
| m | 270 | 3.1% |
None
| Value | Count | Frequency (%) |
| ã | 4002 |
com_recidiva_regional_2
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim | 11 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1107 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Sim |
Common Values
| Value | Count | Frequency (%) |
| Não | 358 | 8.4% |
| Sim | 11 | 0.3% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 358 | |
| sim | 11 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 358 | |
| ã | 358 | |
| o | 358 | |
| S | 11 | 1.0% |
| i | 11 | 1.0% |
| m | 11 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 738 | |
| Uppercase Letter | 369 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 358 | |
| o | 358 | |
| i | 11 | 1.5% |
| m | 11 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 358 | |
| S | 11 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1107 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 358 | |
| ã | 358 | |
| o | 358 | |
| S | 11 | 1.0% |
| i | 11 | 1.0% |
| m | 11 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 749 | |
| None | 358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 358 | |
| o | 358 | |
| S | 11 | 1.5% |
| i | 11 | 1.5% |
| m | 11 | 1.5% |
None
| Value | Count | Frequency (%) |
| ã | 358 |
com_recidiva_local_1
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim | 331 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12816 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Sim |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 3941 | |
| Sim | 331 | 7.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 3941 | |
| sim | 331 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 3941 | |
| ã | 3941 | |
| o | 3941 | |
| S | 331 | 2.6% |
| i | 331 | 2.6% |
| m | 331 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8544 | |
| Uppercase Letter | 4272 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 3941 | |
| o | 3941 | |
| i | 331 | 3.9% |
| m | 331 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3941 | |
| S | 331 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12816 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 3941 | |
| ã | 3941 | |
| o | 3941 | |
| S | 331 | 2.6% |
| i | 331 | 2.6% |
| m | 331 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8875 | |
| None | 3941 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 3941 | |
| o | 3941 | |
| S | 331 | 3.7% |
| i | 331 | 3.7% |
| m | 331 | 3.7% |
None
| Value | Count | Frequency (%) |
| ã | 3941 |
com_recidiva_local_2
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 3903 |
| Missing (%) | 91.4% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim | 25 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1107 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 344 | 8.1% |
| Sim | 25 | 0.6% |
| (Missing) | 3903 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 344 | |
| sim | 25 | 6.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 344 | |
| ã | 344 | |
| o | 344 | |
| S | 25 | 2.3% |
| i | 25 | 2.3% |
| m | 25 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 738 | |
| Uppercase Letter | 369 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 344 | |
| o | 344 | |
| i | 25 | 3.4% |
| m | 25 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 344 | |
| S | 25 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1107 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 344 | |
| ã | 344 | |
| o | 344 | |
| S | 25 | 2.3% |
| i | 25 | 2.3% |
| m | 25 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 763 | |
| None | 344 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 344 | |
| o | 344 | |
| S | 25 | 3.3% |
| i | 25 | 3.3% |
| m | 25 | 3.3% |
None
| Value | Count | Frequency (%) |
| ã | 344 |
| record_id | repeat_instrument_1 | repeat_instrument_2 | repeat_instance_1 | repeat_instance_2 | data_da_primeira_consulta_institucional_dt_pci_1 | data_da_primeira_consulta_institucional_dt_pci_2 | data_do_diagnostico_1 | data_do_diagnostico_2 | codigo_da_topografia_cid_o_1 | codigo_da_topografia_cid_o_2 | codigo_da_morfologia_de_acordo_com_o_cid_o_1 | codigo_da_morfologia_de_acordo_com_o_cid_o_2 | estadio_clinico_1 | estadio_clinico_2 | grupo_de_estadio_clinico_1 | grupo_de_estadio_clinico_2 | classificacao_tnm_clinico_t_1 | classificacao_tnm_clinico_t_2 | classificacao_tnm_clinico_n_1 | classificacao_tnm_clinico_n_2 | classificacao_tnm_clinico_m_1 | classificacao_tnm_clinico_m_2 | metastase_ao_diagnostico_cid_o_1_1 | metastase_ao_diagnostico_cid_o_1_2 | metastase_ao_diagnostico_cid_o_2_1 | metastase_ao_diagnostico_cid_o_2_2 | metastase_ao_diagnostico_cid_o_3_1 | metastase_ao_diagnostico_cid_o_3_2 | metastase_ao_diagnostico_cid_o_4_1 | metastase_ao_diagnostico_cid_o_4_2 | data_do_tratamento_1 | data_do_tratamento_2 | combinacao_dos_tratamentos_realizados_no_hospital_1 | combinacao_dos_tratamentos_realizados_no_hospital_2 | ano_do_diagnostico_1 | ano_do_diagnostico_2 | lateralidade_do_tumor_1 | lateralidade_do_tumor_2 | data_de_recidiva_1 | data_de_recidiva_2 | tempo_desde_o_diagnostico_ate_a_primeira_recidiv_1 | tempo_desde_o_diagnostico_ate_a_primeira_recidiv_2 | local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_2 | descricao_da_morfologia_de_acordo_com_cid_o_1 | descricao_da_morfologia_de_acordo_com_cid_o_2 | descricao_da_topografia_1 | descricao_da_topografia_2 | classificacao_tnm_patologico_n_1 | classificacao_tnm_patologico_n_2 | classificacao_tnm_patologico_t_1 | classificacao_tnm_patologico_t_2 | com_recidiva_a_distancia_1 | com_recidiva_a_distancia_2 | com_recidiva_regional_1 | com_recidiva_regional_2 | com_recidiva_local_1 | com_recidiva_local_2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 302 | Registro De Tumores | NaN | 1.0 | NaN | 2008-03-22 | NaN | 2008-03-23 | NaN | C504 | NaN | 85003.0 | NaN | IIA | NaN | II | NaN | 2 | NaN | 0 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-08-15 | NaN | Cirurgia + Radio + Quimio + Hormonio | NaN | 2008.0 | NaN | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE SUPERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 1 | 710 | Registro De Tumores | NaN | 1.0 | NaN | 2006-11-11 | NaN | 2007-11-11 | NaN | C508 | NaN | 85003.0 | NaN | IIIA | NaN | III | NaN | 3 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-05-29 | NaN | Cirurgia + Quimioterapia | NaN | 2008.0 | NaN | Esquerda | NaN | 2014-07-19 | NaN | 2442.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA LESAO SOBREPOSTA DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Sim | NaN | Sim | NaN |
| 2 | 752 | Registro De Tumores | NaN | 1.0 | NaN | 2007-09-25 | NaN | 2007-12-18 | NaN | C509 | NaN | 84803.0 | NaN | IIA | NaN | II | NaN | 2 | NaN | 0 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-04-07 | NaN | Outras combinações | NaN | 2008.0 | NaN | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | ADENOCARCINOMA MUCINOSO | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | X - nao foi possivel determinar | NaN | 2 | NaN | Não | NaN | Não | NaN | Não | NaN |
| 3 | 1367 | Registro De Tumores | NaN | 1.0 | NaN | 2008-02-03 | NaN | 2008-02-06 | NaN | C505 | NaN | 85003.0 | NaN | IIA | NaN | II | NaN | 1 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-09-29 | NaN | Outras combinações | NaN | 2008.0 | NaN | Esquerda | NaN | 2010-07-15 | NaN | 890.0 | NaN | C34 - Bronquios e Pulmoes | NaN | C50 - Mama | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE INFERIOR EXTERNO DA | NaN | 1 | NaN | 1A | NaN | Não | NaN | Sim | NaN | Não | NaN |
| 4 | 1589 | Registro De Tumores | NaN | 1.0 | NaN | 2008-05-15 | NaN | 2008-05-21 | NaN | C508 | NaN | 85003.0 | NaN | IIB | NaN | II | NaN | 2 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-09-16 | NaN | Cirurgia + Radio + Quimio | NaN | 2008.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA LESAO SOBREPOSTA DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 5 | 1705 | Registro De Tumores | NaN | 1.0 | NaN | 2007-05-09 | NaN | 2007-05-10 | NaN | C504 | NaN | 85003.0 | NaN | IIA | NaN | II | NaN | 1 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2007-12-06 | NaN | Cirurgia + Radioterapia | NaN | 2008.0 | NaN | Direita | NaN | 2012-12-19 | NaN | 2050.0 | NaN | C38 - Coração, Mediastino e Pleura, | NaN | C71 - Encefalo | NaN | C34 - Bronquios e Pulmoes | NaN | C50 - Mama | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE SUPERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Sim | NaN | Não | NaN |
| 6 | 1843 | Registro De Tumores | NaN | 1.0 | NaN | 2008-12-07 | NaN | 2008-07-27 | NaN | C509 | NaN | 85003.0 | NaN | IIA | NaN | II | NaN | 1C | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2009-01-25 | NaN | Quimioterapia | NaN | 2008.0 | NaN | não se aplica | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | 1 | NaN | 1C | NaN | Não | NaN | Não | NaN | Não | NaN |
| 7 | 1873 | Registro De Tumores | NaN | 1.0 | NaN | 2008-12-08 | NaN | 2008-08-30 | NaN | C509 | NaN | 85003.0 | NaN | IIB | NaN | II | NaN | 2 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2008-12-12 | NaN | Outras combinações | NaN | 2008.0 | NaN | Esquerda | NaN | 2016-02-29 | NaN | 2739.0 | NaN | C71 - Encefalo | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | NaN | Sim | NaN | Não | NaN | Não | NaN |
| 8 | 1898 | Registro De Tumores | NaN | 1.0 | NaN | 2008-08-23 | NaN | 2008-06-20 | NaN | C509 | NaN | 85003.0 | NaN | IV | NaN | IV | NaN | 4 | NaN | 2 | NaN | 1 | NaN | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | NaN | C22 - FÃgado e Das Vias Biliares Intra-hepáticas | NaN | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | NaN | NaN | NaN | 2008-10-21 | NaN | Quimioterapia | NaN | 2008.0 | NaN | Esquerda | NaN | 2009-08-14 | NaN | 420.0 | NaN | C71 - Encefalo | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | NaN | Não | NaN | Sim | NaN | Não | NaN |
| 9 | 1960 | Registro De Tumores | NaN | 1.0 | NaN | 2009-01-30 | NaN | 2008-07-28 | NaN | C509 | NaN | 85003.0 | NaN | IIIA | NaN | III | NaN | 3 | NaN | 2 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2009-01-30 | NaN | Outras combinações | NaN | 2008.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| record_id | repeat_instrument_1 | repeat_instrument_2 | repeat_instance_1 | repeat_instance_2 | data_da_primeira_consulta_institucional_dt_pci_1 | data_da_primeira_consulta_institucional_dt_pci_2 | data_do_diagnostico_1 | data_do_diagnostico_2 | codigo_da_topografia_cid_o_1 | codigo_da_topografia_cid_o_2 | codigo_da_morfologia_de_acordo_com_o_cid_o_1 | codigo_da_morfologia_de_acordo_com_o_cid_o_2 | estadio_clinico_1 | estadio_clinico_2 | grupo_de_estadio_clinico_1 | grupo_de_estadio_clinico_2 | classificacao_tnm_clinico_t_1 | classificacao_tnm_clinico_t_2 | classificacao_tnm_clinico_n_1 | classificacao_tnm_clinico_n_2 | classificacao_tnm_clinico_m_1 | classificacao_tnm_clinico_m_2 | metastase_ao_diagnostico_cid_o_1_1 | metastase_ao_diagnostico_cid_o_1_2 | metastase_ao_diagnostico_cid_o_2_1 | metastase_ao_diagnostico_cid_o_2_2 | metastase_ao_diagnostico_cid_o_3_1 | metastase_ao_diagnostico_cid_o_3_2 | metastase_ao_diagnostico_cid_o_4_1 | metastase_ao_diagnostico_cid_o_4_2 | data_do_tratamento_1 | data_do_tratamento_2 | combinacao_dos_tratamentos_realizados_no_hospital_1 | combinacao_dos_tratamentos_realizados_no_hospital_2 | ano_do_diagnostico_1 | ano_do_diagnostico_2 | lateralidade_do_tumor_1 | lateralidade_do_tumor_2 | data_de_recidiva_1 | data_de_recidiva_2 | tempo_desde_o_diagnostico_ate_a_primeira_recidiv_1 | tempo_desde_o_diagnostico_ate_a_primeira_recidiv_2 | local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_1_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_2_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_3_cid_o_topografia_2 | local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_1 | local_de_recidiva_a_distancia_metastase_4_cid_o_topografia_2 | descricao_da_morfologia_de_acordo_com_cid_o_1 | descricao_da_morfologia_de_acordo_com_cid_o_2 | descricao_da_topografia_1 | descricao_da_topografia_2 | classificacao_tnm_patologico_n_1 | classificacao_tnm_patologico_n_2 | classificacao_tnm_patologico_t_1 | classificacao_tnm_patologico_t_2 | com_recidiva_a_distancia_1 | com_recidiva_a_distancia_2 | com_recidiva_regional_1 | com_recidiva_regional_2 | com_recidiva_local_1 | com_recidiva_local_2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4262 | 82100 | Registro De Tumores | NaN | 1.0 | NaN | 2020-07-24 | NaN | 2020-07-24 | NaN | C504 | NaN | 85003.0 | NaN | IIIB | NaN | NaN | NaN | 4B | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-10-07 | NaN | Cirurgia + Radio + Quimio | NaN | 2020.0 | NaN | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE SUPERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4263 | 82111 | Registro De Tumores | NaN | 1.0 | NaN | 2020-08-09 | NaN | 2020-06-27 | NaN | C504 | NaN | 85002.0 | NaN | 0 | NaN | NaN | NaN | IS | NaN | 0 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-12-06 | NaN | Outras combinações | NaN | 2020.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA INTRADUCTAL NAO INFILTRANTE SOE | NaN | MAMA QUADRANTE SUPERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4264 | 82112 | Registro De Tumores | NaN | 1.0 | NaN | 2020-09-08 | NaN | 2020-09-29 | NaN | C505 | NaN | 85003.0 | NaN | IIIA | NaN | NaN | NaN | 3 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-11-23 | NaN | Outras combinações | NaN | 2020.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE INFERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4265 | 82118 | Registro De Tumores | NaN | 1.0 | NaN | 2020-01-28 | NaN | 2020-02-27 | NaN | C509 | NaN | 85003.0 | NaN | IA | NaN | NaN | NaN | 1C | NaN | 0 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-05-26 | NaN | Outras combinações | NaN | 2020.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4266 | 82122 | Registro De Tumores | NaN | 1.0 | NaN | 2020-11-04 | NaN | 2020-07-06 | NaN | C509 | NaN | 85003.0 | NaN | IIA | NaN | NaN | NaN | 2 | NaN | 0 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-12-03 | NaN | Outras combinações | NaN | 2020.0 | NaN | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4267 | 82123 | Registro De Tumores | Registro De Tumores | 1.0 | 2.0 | 2020-12-04 | 2020-12-04 | 2020-10-10 | 2020-10-10 | C504 | C509 | 85003.0 | 85003.0 | IIB | IIA | NaN | NaN | 3 | 2 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-12-14 | 2020-12-14 | Outras combinações | Outras combinações | 2020.0 | 2020.0 | Direita | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | CARCINOMA DUCTAL INFILTRANTE SOE | MAMA QUADRANTE SUPERIOR EXTERNO DA | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | Não | Não | Não | Não | Não | Não |
| 4268 | 82124 | Registro De Tumores | Registro De Tumores | 1.0 | 2.0 | 2020-06-20 | 2020-06-20 | 2020-09-05 | 2020-09-05 | C509 | C509 | 85203.0 | 85002.0 | IV | 0 | NaN | NaN | 4D | CDIS | 1 | 0 | 1 | 0 | C38 - Coração, Mediastino e Pleura, | NaN | C22 - FÃgado e Das Vias Biliares Intra-hepáticas | NaN | C34 - Bronquios e Pulmoes | NaN | C41 - Ossos e Das Cartilagens Articulares de Outras Localizações | NaN | 2021-01-12 | 2021-01-12 | Quimioterapia | Quimioterapia | 2020.0 | 2020.0 | Esquerda | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA LOBULAR SOE | CARCINOMA INTRADUCTAL NAO INFILTRANTE SOE | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | MAMA SOE (EXCLUI PELE DA MAMA C44.5) | NaN | NaN | NaN | NaN | Não | Não | Não | Não | Não | Não |
| 4269 | 82131 | Registro De Tumores | NaN | 1.0 | NaN | 2020-11-01 | NaN | 2019-12-23 | NaN | C502 | NaN | 85203.0 | NaN | IIIA | NaN | NaN | NaN | 3 | NaN | 1 | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2020-12-23 | NaN | Cirurgia + Radioterapia | NaN | 2020.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA LOBULAR SOE | NaN | MAMA QUADRANTE SUPERIOR INTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |
| 4270 | 82205 | Registro De Tumores | NaN | 1.0 | NaN | 2021-02-28 | NaN | 2020-11-07 | NaN | C504 | NaN | 85003.0 | NaN | IV | NaN | NaN | NaN | 4D | NaN | 1 | NaN | 1 | NaN | C71 - Encefalo | NaN | C77 - Secundária e Não Especificada Dos Gânglios Linfáticos | NaN | C49 - Tecido Conjuntivo e de Outros Tecidos Moles | NaN | NaN | NaN | 2021-03-27 | NaN | Cirurgia + Radio + Quimio | NaN | 2020.0 | NaN | Esquerda | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE SUPERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Sim | NaN |
| 4271 | 82240 | Registro De Tumores | NaN | 1.0 | NaN | 2020-12-08 | NaN | 2020-03-14 | NaN | C505 | NaN | 85003.0 | NaN | IIIC | NaN | NaN | NaN | 2 | NaN | 3A | NaN | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-01-12 | NaN | Outras combinações | NaN | 2020.0 | NaN | Direita | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | CARCINOMA DUCTAL INFILTRANTE SOE | NaN | MAMA QUADRANTE INFERIOR EXTERNO DA | NaN | NaN | NaN | NaN | NaN | Não | NaN | Não | NaN | Não | NaN |